Pratik BarjatiyaExploring Strongly Connected Components and the Kosaraju AlgorithmIn the field of graph theory, understanding the connectivity and structure of networks is of paramount importance. Strongly Connected…Jun 3, 2023Jun 3, 2023
InData And BeyondbyPratik BarjatiyaExploring the Apache ORC File Format: Advantages, Use Cases, and Best Practices for Data Storage…An ORC (Optimized Row Columnar) file is a high-performance data storage format designed for Hadoop and other big data processing systems…Jan 23, 20231Jan 23, 20231
InData And BeyondbyPratik BarjatiyaOverview of Parquet and Why It Gels with PySparkParquet is a columnar storage format for big data. It is a self-describing format that allows for efficient compression and encoding…Jan 22, 2023Jan 22, 2023