Saurav AgarwalDistributed Data Science using NVTabular on Spark & DaskNVTabular — Why, What, and How?7 min read·Feb 18, 2022----
Saurav AgarwalBitcoin Price Prediction using RAPIDS : cuDF, DLPack and Keras on GPU6 min read·May 17, 2021----
Saurav AgarwalToo Small Data — Solving Small Files issue using SparkI am pretty sure you all must have come across the issue of Small files issues while working with Big Data frameworks like Spark, Hive etc.2 min read·Dec 25, 2020--1--1
Saurav AgarwalinDataDrivenInvestorData Masking in Big Data [Spark]We often face challenges over masking data in our Big Data pipelines so that all sensitive data is masked from the unauthorized users…3 min read·Aug 8, 2019--1--1
Saurav AgarwalinDataDrivenInvestorHandling complexity in Big Data — Process nested json with changing schema tagsYou may have seen various cases of reading json data ranging from nested structure to json having corrupt structure. But, lets see how do…4 min read·Nov 3, 2018--3--3
Saurav AgarwalNoSQL SimplifiedBasics of NoSQL NoSQL eliminates the need of a schema, hence pushing the data handling capacity by a huge margin by compromising on ACID…3 min read·Sep 22, 2018----