• Training
  • ACT82003 Alibaba Cloud E-MapReduce
ACT82003 Alibaba Cloud E-MapReduce
  • Training
  • ACT82003 Alibaba Cloud E-MapReduce
ACT82003 Alibaba Cloud E-MapReduce
Class at a Glance
  • Training Number
    ACT82003
  • Type
    Offline classroom training
  • Class Duration
    1 Day(7 Hours)
  • Available Languages
    English, Chinese
  • Hands-on Labs
    2 Hands-on Labs
Introduction
In this course, you will learn how to use Alibaba Cloud’s E-MapReduce (EMR), a managed Hadoop cluster service. Learn how to work with EMR to create, configure, manage, and scale Hadoop clusters on Alibaba Cloud, allowing you to store and process huge datasets both offline and in real time.
Recommended For
Data Developers and Engineers with Hadoop experience, who are looking at migrating workloads to Alibaba Cloud.
Highlight
  • Understand the advantages (cost, performance, resilience) of EMR over self-built Hadoop
  • See best practices and use-cases for EMR, and learn how data can be migrated to the cloud
  • Get hands-on experience creating EMR clusters and running jobs in Hive
Training Outline
  • Hadoop and principles of distributed systems
  • Advantages of EMR over self-built Hadoop
  • Data import and data development using Sqoop and Hive
  • In-depth introduction to Spark (and PySpark)
  • Overview of advanced features: auto scaling, using OSS buckets to replace HDFS, and more
  • Hands-on labs focusing on data processing with Hive and Spark
Upcoming Training Classes
Currently no classes available