Jun 10, 2015 11:00 PMLocation:
Constant Contact Reservoir Place
1601 Trapelo Road Waltham MA 021451
Apache Hadoop is a powerful and sometimes complex tool for dealing with Big Data as well as high data throughput applications which can enable some existing applications to finally run right as well as open doors for entirely new types of applications and analysis. So the question is how does one get started with Hadoop? This presentation explores the various introductory aspects of the Hadoop infrastructure, data sources and query strategies and planning so you can get started with Hadoop.
Through this introductory no non-sense presentation we will explore various environmental options to design your initial cluster; such as physical vs virtual environments. In addition, we will explore various data ingestion and modeling strategies so you can populate your new cluster with the data required for your analysis in an Agile way. Finally, we will review various strategies available to process and query your data so you can get value from the cluster.