Big Data Hadoop Training
Industry 4.0 is talking a lot about Big Data. But – what is Big Data? How is it going to impact the industry? How is it going to bring opportunities? How can one learn and practice Big Data? Can any graduate opt for Big Data training?
Join us to discuss and learn Big Data. Recro India is organizing instructor-led online virtual sessions, which shall not only focus on theoretical but practical aspects of Big Data.
At the end of the training answer yourself, if Big Data is a good news, a bad news or fake news!
Training started on 17th AUG 2019 at 1330 hours (GMT)
Repeat sessions for 17th Aug and 18th Aug, is planned on 24th Aug and 25th Aug, respectively.
Total number of ONLINE sessions – 10 (Ten sessions), each of 4 hours duration (every Saturday and Sunday)
Fees (incl. of all taxes) for 10 sessions –
INR 17,200 (for Indian residents)
~ USD 250 (for non-Indian residents)
Please use the following button to complete your payment and register for the course – PayPal Payment Link
DIGITAL LEARNING – Learn on the GO!
Instructor-led online virtual sessions through Zoom Meetings Application. The live session can be taken through mobile / tablet / desktop / laptop, all you need is to log in through Zoom session, during the scheduled time.
Train yourself while on travel, at home, at vacation or at office.
All sessions shall be recorded by your instructor, videos of which shall be made accessible to you.
Participate in the Interactive Live Online classes and get your doubts clarified, instantly.
Requirements – Good internet connectivity and head-phone with mic.
Every Saturday and Sunday at 1330 hours (GMT), starting from 17th Aug 2019. Each day’s session shall be for 4 hours. There shall be 10 sessions in total.
- Understand the Ecosystem of Hadoop
- Know-how of Machine Learning
- How to manage and make sense of big data sets
- Familiarization with Cloud Computing
- Using Big data and machine learning together
- Get hands-on R and Python
Prerequisites: Basic programming skills and familiarization with basic concepts of Statistics and Mathematics
Who should take this course:
- Software Developers and Architects
- Senior IT professionals
- Testing and Mainframe Professionals
- Data Management Professionals
- Business Intelligence Professionals
- Project Managers
- Aspiring Data Scientists
- Graduates looking to build a career in Big Data Analytics
- Industry readiness on key concepts Big Data
- Real-life case study
- Ability to deep dive with large data sets
- Potential to create business value using data inferences
WEEK 1:
- Theory
- Introduction to Big Data
- Introduction to Hadoop
- Hadoop Ecosystem and Architecture
- Introduction to HDFS
- Deep Dive: HDFS
- Introduction to Hive
- Practical
- Installing Oracle VirtualBox
- Installing Cloudera Sandbox
- Ambari Usage
- Storing a large file in HDFS
- Quering this large file using Hive
WEEK 2:
- Theory
- Introduction to MapReduce
- Deep Dive: MapReduce
- Introduction to NoSQL
- Practical
- Implementing a wordCount Program(MR)
- Implementing a wordCount Program(PySpark)
- Installing MongoDB
WEEK 3:
- Theory
- Introduction to Machine learning
- Deep Dive: Machine Learning
- Deep Dive: NoSQL
- Revision, Recap and QnA
- R Programming
- Practical
- Implementing Machine Learning Classifier using Python
- Operations on MongoDB using Python
WEEK 4:
- Theory
- Deep Dive: Hive
- Introduction to Cloud Computing
- Deep Dive: Cloud Computing
- Key Cloud Platforms
- Deep Dive: Pig Scripts
- Deep Dive: Oozie
- Practical
- Querying Hive
- Tour of various cloud platforms
- Implementing a wordCount Program(pig)
- Understanding Oozie and Zookeeper using wordCount Program
WEEK 5:
- Theory
- Deep Dive: Zookeeper
- Deep Dive: Spark
- Revision, Recap and QnA
- Spark + Machine Learning Usecase
- Overview of Big Data Stores
- Practical
- Implementing a wordCount Program(spark)
- Implementing Machine Learning using PySpark
During the training, course instructor will spend a lot of time on practical. The following software will be used to provide hands-on experience during the course (but not limited to) €“
- Oracle Virtual Box
- Cloudera Sandbox
- HDFS
- Hive
- Spark, PySpark
- Mongo DB
- Python
- Oozie
- Zookeeper
- PIG
- Python
- R
- Source Code used during practicals
- Data Files where ever applicable
- URLs of different cloud based applications
- Training playbook
Soft copy of 40 hours Training Participation Certificate from Recro India, to those students who attend 40 hours of live online sessions.
Our instructor has been involved in trainings and consultancy assignments, across various domains, with aim to deliver the potential of Artificial Intelligence, Data Sciences and Big Data to make the journey from data to decisions much more exciting and of greater value.
- Working professional in the domain of data science
- Have successfully delivered more than 800hrs of training with very high customer satisfaction rate
- Currently working on Artificial Intelligence projects, involving large volumes of data
- Developed online training content for engineering graduates, to enhance their employability and billing , for a large Fortune 100 IT company
- Neutral accent
INR 17,200 (equivalent to USD 250) per participant for 10 live online sessions.
Payment shall be collected through secured PayPal Payment Gateway.
Few feedbacks and testimonials have been scripted below. Because of lack of space, we have not been able to display all scripts.