Category: Big Data

Taming the Snowflake: Strategies to Slash Your Cloud Data Warehouse Costs

Taming the Snowflake: Strategies to Slash Your Cloud Data Warehouse Costs

My experience with Snowflake spans several different organizational contexts, it’s power in scaling to meet business data requirements is closely followed by how the costs can similarly scale. This post explores practical strategies to help you reduce your Snowflake costs without sacrificing performance. Starting with general guidance, then dipping into technical strategies including dbt specific packages and approaches.

RLIKE in hive: Filtering with regular expressions

RLIKE in hive: Filtering with regular expressions

As I previously did a blog post on Querying SQL Server with something LIKE a regular expression (Using simple regular expressions in a LIKE statement), I thought I would use that as a segue into Apache Hive and HiveQL. Because Hive and therefore HiveQL is built using Java, it has the full power of Java regular…

Read More Read More

Dates In Hive

Dates In Hive

After previously spending a lot of time looking at dates in SQL Server and oracle, I wanted to do the same justice to Hive.

Getting LEFT & RIGHT in hive

Getting LEFT & RIGHT in hive

Hive doesn’t have the functions LEFT() and RIGHT() as used in other RDBMS/SQL platforms. Here I demonstrate using hive functions for parsing text values which can emulate LEFT and RIGHT.