Apache Spark and Zeppelin
an open-source, web-based "notebook" that enables interactive data analytics and collaborative documents.
Apache Zeppelin is an open-source, web-based “notebook” that enables interactive data analytics and collaborative documents. The notebook is integrated with distributed, general-purpose data processing systems such as Apache Spark (Large Scale data processing), Apache Flink (Stream processing framework), and many others. Apache Zeppelin allows you to make beautiful, data-driven, interactive documents with SQL, Scala, R, or Python right in your browser.
Data ingestion in the zeppelin can be done with Hive, HBase, and other interpreters provided by the zeppelin.
Zeppelin provides Postgres, HawQ, Spark SQL, and other Data discovery tools, with spark SQL the data can be explored.
Spark, Flink, R, Python, and other useful tools are already available in the zeppelin and the functionality can be extended by simply adding the new interpreter.
Data Visualization and Collaboration
All the basic visualization like Bar chart, Pie chart, Area chart, Line chart and scatter chart are available in a zeppelin.
In FileGPS we use the Spark Streaming component integrating with Kafka for data computation.
Apache Spark Streaming
Connect With Us
For any problems or questions, feel free to reach out to us by filling up the form