Project (Apache Spark, Matplotlib, pyspark) | Project (Star Schema, Apache Spark, AWS S3, and Python) This solution applies star schema model by- extracting JSON data files on AWS S3 to DataFrames of Apache Spark (PySpark),
- transforming raw data to star schema by using Apache Spark, and
- loading these dataframes of star schema as Parquet files (column-oriented storage format) on AWS S3. </ul> </td> </tr>
NoSQL Apache Cassandra | Project (Star Schema, AWS Redshift, AWS S3, and Python) | Project Neo4J (REST, SparkJava, Neo4J Graph DB, Cypher QL, Java)
|
|