Lập trình Apache Hadoop

Overview:

The 4-day Apache Hadoop Developer program will not only provide you with the foundation and concepts in your understanding of Big Data and the all the associated technologies in Apache Hadoop and the Hadoop Ecosystem but will also provide you with hands-on lab exercises that will equip you with the knowledge to write‚ maintain and optimize Big Data projects and Hadoop jobs.

Duration:  04 days (32 hours)
Intended Audience:

-      IT Developers and Engineers who wish to write‚ maintain and/or optimize Apache Hadoop or Big Data projects.

Prerequisites:

-      Basic database knowledge

-      Prior knowledge in Java is recommended but experience with Python‚ PHO or C# is sufficient

Course outlines:

1.      Introduction to Big Data and Hadoop (Day 1)

  • Introduction to Big Data
  • Hadoop Overview
  • Hadoop Basic Concepts
  • Writing MapReduce Applications
  • Lab Exercises

2.      Introduction to Hadoop API and MapReduce (Day 2)

  • Reducers and Partitioners
  • Hadoop API
  • Unit Testing
  • Optimizing MapReduce Jobs
  • Lab Exercises

3.      MapReduce Algorithms and Advanced Features (Day 3)

  • Input and Output Formats
  • Common MapReduce Algorithms
  • Advanced MapReduce Feature
  • Lab Exercises

4.      Overview of the Hadoop Ecosystem Technologies and Tools (Day 4)

  • Sqoop and Flume Overview
  • HBase Overview
  • Hive and Pig Overview
  • Oozie Workflow Overview
  • Appendix
  • Lab Exercises
  • Học trực tuyến

  • Học tại Hồ Chí Minh

  • Học tại Hà Nội


Các khóa học khác