Apr 4, 2018

Apache Spark: An Engine for Large-Scale Data Processing

Apache Spark has become the engine to enhance many of the capabilities of the ever-present Apache Hadoop environment. 

For Big Data, Apache Spark meets a lot of needs and runs natively on Apache Hadoop’s YARN. By running Apache Spark in your Apache Hadoop environment, you gain all the security, governance, and scalability inherent to that platform. Apache Spark is also extremely well integrated with Apache Hive and gains access to all your Apache Hadoop tables utilizing integrated security.