Categories Posted inin CodePython Introduction to Pandas DataFrames in Python Posted byby Stian Skotland January 5, 2025 Pandas are one of the most powerful and flexible libraries in Python for data analysis and manipulation. I would dare to say it’s an... Continue Reading
Categories Posted inin CodeData Platform Partitioning Tables in BigQuery Posted byby Stian Skotland December 19, 2024 If you are analyzing relatively small amounts of data, then your queries won’t cost very much, but if you regularly analyze huge datasets, then... Continue Reading
Categories Posted inin CodeSQL Mastering and Understanding the Join Function in SQL Posted byby Stian Skotland December 18, 2024 Joins are one of the most frequently used functions in SQL, and that is for good reason. They play an important role in relational... Continue Reading
Categories Posted inin Data PlatformGoogle BigQuery Generating Dates Between Two Dates in BigQuery Posted byby Stian Skotland December 15, 2024 Generating a list of dates between two dates can be very helpful in many situations. Many source systems typically provide you with a start... Continue Reading
Categories Posted inin Data PlatformGoogle BigQuery Incremental Load of Delta Records Using Dataform in BigQuery Posted byby Stian Skotland December 12, 2024 When working with large datasets in data pipelines, processing every record repeatedly can be inefficient. A common approach to optimize this process is to... Continue Reading