Understanding Data Pipelines - Kelly's Bytes - 11/12/24 (#109)


KELLY'S BYTES

By Kelly J. Adams

Hey!

You might’ve noticed that I’ve changed the format of my newsletter. I’ve been spending the last few weeks really thinking about how I want to structure my content from now on. So each newsletter contains:

  • Main Lesson: Something I’ve learned or am working on
  • 3 bite-sized resources: Still related to data or learning new skills

Main Lesson: SQL Functions

Understanding how data flows into tables has been a game-changer for my SQL workflow. Here’s why:

  • Conditional Filtering: CASE WHEN statements let me filter based on specific criteria directly in SQL, reducing extra data cleaning steps.
  • Focused Aggregation: I can aggregate data by distinctions like regular actions vs. high-value outcomes, without complex joins or additional processing.

Here’s an example query which counts player actions (like regular moves, bonuses, and high scores) within a date range, helping me spot patterns and anomalies:

For more on using CASE WHEN to streamline data filtering and support product decisions, check out this week’s blog post below!


Bite-Sized Resources

📺 Video: [Data Analytics] How I'd Learn to be a Data Analyst in 2024 from Luke Barousse on everything he’d do to become a data analyst that goes over which tools to learn, where to learn them and more.

🎧 Podcast: [Data Science] Beyond Borders: Elevating Women in Data Science and Leadership from the Women in Data Science Worldwide Podcast. Featuring Hannah Pham, Head of Data Science, Consumers at Pinterest, and her experience as an immigrant and the evolving nature of data science.

📝 Post: [Data Analytics] LinkedIn Post from Karina Samsonova with lessons she learned in her first year as a data analyst.


Courses & Resources

📚 ​Excel Course​: A free course that teaches you Excel from beginner concepts like formulas and functions to more advanced concepts like Power Query.

📚 ​SQL Course​: Learn basic SQL in 4 hours going over things like aggregation, common analysis techniques and more.

📚 ​Python Course​: Goes into not only using Python to analyze data but also visualize it as well with libraries like Matplotlib.

📊 How to Become a Data Analyst: Check out my blog post on how you can become a data analyst.

📊 Data Analytics Resources: View all the resources (e.g. podcasts, courses, etc.) I used to learn data analytics.

The links for my courses are affiliate links. If you decide to purchase the course through this link, I may receive a small commission at no extra cost to you. This support helps me continue to provide content. Thank you for your support!


Until next time,

- Kelly

Kelly's Bytes

Every other Friday, I share one practical lesson, highlight a blog post, and share 3 bite-sized resources to help you upskill and stay updated.

Read more from Kelly's Bytes

KELLY'S BYTES By Kelly J. Adams Hey! Thanks for subscribing and here’s this weeks edition lesson: Main Lesson: SQL User-Defined Functions One of the big improvement I’ve made in my SQL workflow recently is using User-Defined Functions (UDFs) in BigQuery. Here’s why: What it is: A UDF is a custom function you define to simplify repetitive logic in SQL queries. It can be written in SQL (my preference) or JavaScript. How it helps: Reduces redundancy, improves maintainability, and ensures...

KELLY'S BYTES By Kelly J. Adams Hey! Thanks for subscribing and here’s this weeks edition lesson: Main Lesson: T-Shaped Approach to Learning One of the biggest improvements I’ve made in my career planning is by adopting the T-shaped approach to skill-building. Here’s why it’s worth considering: What it is: A framework that balances deep expertise in one area (like data analytics) with broad knowledge across related areas (like business departments and industries). How it helps: It keeps your...

KELLY'S BYTES By Kelly J. Adams Hey! Thanks for subscribing and here’s this weeks edition lesson: Main Lesson: Using SQL for Better Data Analysis Recently, I’ve made improvements to my SQL Skills from SQL for Data Analysis by Cathy Tanimura. One lesson for me was the importance of using less common SQL functions, like CROSS JOINs, for more flexible analyses. Here’s why this matters: What it is: CROSS JOINs allow you to combine all rows from two datasets, which is especially useful for filling...