Exploring Data Integration and OLAP Queries with Spark

·Dec 04, 2020 04:11 PM

Yong Tang. Great set of new references for me to look at, thank you! I’m on a similar quest… for data integration and OLAP type queries, powered by Spark over data in a data lake.

https://github.com/SANSA-Stack/SANSA-Stack is powered by Spark RDDs. This seems active but built on the older Spark 1.x RDD structure
OnTop VKG promises to map Sparql to SQL (thus query Big Data sql engines like Spark SQL)
S2RDF and some other papers on optimizing Sparql queries over Spark

👍2

Exploring Data Integration and OLAP Queries with Spark

7 comments

Exploring Data Integration and OLAP Queries with Spark

7 comments