I'm trying to evaluate the differences between these two options. Here are some pros and cons I can think of :
Elastic Map Reduce => Better support from Amazon, No need to administer cluster, More Expensive (?)
EC2 + Hadoop => More control of your hadoop configuration, Cheaper (?)
I'm wondering if anyone might have benchmarked the performance of EC2 + Hadoop vis a vis EMR? Is there any significant difference in cost for large cluster deployments? What other differences exist?
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…