Until recently, most companies used the traditional approach for storing all the company’s data in a Data Warehouse. The internet growth caused an increase in the number of data sources and the massive quantities of data to be stored, requiring scaling these Data Warehouses constantly. They were not designed to handle petabytes of data, so companies wereContinue reading “Querying our Data Lake in S3 using Zeppelin and Spark SQL”