skip to content
Kostas Pardalis

Posts

  • A Tutorial on SQL Window Functions Using DuckDB.

    DuckDB provides 14 SQL window-related functions in addition to all the aggregation functions that can be combined with windows. Snowflake, on the other hand, offers more than 70 functions that can be used with SQL windows. PostgreSQL also supports 11 SQL window-related functions, as well as all the aggregation functions that are packaged by default, in addition to any user-provided aggregation function. Hopefully, the above information has captured your attention and helped you realize how important SQL windows are, based on the effort database vendors are making to add support for them.

    [... 3,780 words]

    data data engineering
  • What Happened to the API Economy?.

    > TL;DR: We have an API Economy today, but it's not exactly what people were expecting 10 years ago when it started. The good news is we're starting a new phase of innovation that will change things up. Let's talk about why and how. At the start of the 2010s, the Cloud was transforming everything. SaaS was everywhere. CRMs like Salesforce, Ticketing Systems like Zendesk, and e-commerce solutions like Shopify were all transforming every aspect of the economy.

    [... 2,240 words]

    data api developer
  • Trino Internals - Parameterized Timestamp Types.

    !bunny # Introduction In this post I want to take you through a journey that starts with Github Issue #1284 requesting support for nanosecond/microsecond precision in TIMESTAMP for Trino and ends with the merge of Github PR #3783 that added support for the parametric TIMESTAMP type to the Trino query engine. This journey includes a number of surprises too!

    [... 2,401 words]

    data databases systems
  • How Snowflake Pricing Works.

    Snowflake pricing is a bit complicated, this is a result of *usage based pricing* together with their evolution into a *serverless database*. Let’s start with the fundamentals. The pricing landing page is an excellent starting point to the Snowflake pricing model and to its complexity. Pricing is affected by the following factors:

    [... 832 words]

    data pricing snowflake
  • About Data Observability.

    Observability is something that is used by engineers in the context of building reliable systems. Observability is not a new term, you can find a great article about observability on wikipedia. What is important to keep is the following definition: *Observablity is a measure of how well internal states of a system an be inferred from knowledge of its external outputs.*

    [... 1,386 words]

    data Ingestion data engineering
  • Starting and Growing a Podcast Show.

    Almost two years and more than 80 episodes later on the Data Stack Show, I have a completely different understanding of what it takes to be a host of a podcast and why someone would want to become one. This post is a reflection of all the things I've learned, and I feel I should share.

    [... 4,009 words]

    data podcast content
  • Data Ingestion Standards.

    Benn argues in his post about the importance of standardization as a too to organize the messiness of the data ecosystem. One of the areas that standards have became important in many different, and not very obvious ways, is data ingestion. By standardization in data ingestion, we mainly refer to a number of frameworks that help us build connectors that are either extracting or loading data from and to different systems. Benn mentions as the main reason for seeking standardization the messiness of the data ecosystem, but I’ll argue that there are a couple of different reasons that these data...

    [... 2,048 words]

    data Ingestion data engineering