Quantcast
Viewing all articles
Browse latest Browse all 166

Job Scheduling in Apache Hadoop

(guest blog post by Matei Zaharia)

When Apache Hadoop started out, it was designed mainly for running large batch jobs such as web indexing and log mining. Users submitted jobs to a queue, and the cluster ran them in order. However, as organizations placed more data in their Hadoop clusters and developed more computations they wanted to run, another use case became attractive: sharing a MapReduce cluster between multiple users.

Read More

The post Job Scheduling in Apache Hadoop appeared first on Cloudera Engineering Blog.


Viewing all articles
Browse latest Browse all 166

Trending Articles