About:

Jordan Goodman is a data professional who publishes the Data Methods newsletter, helping readers learn about data and sharing his journey in the field.

Website:

Specializations:

Interests:

Data Data methods Learning about data Newsletter publishing
Subscribe to RSS:
The post outlines the evolution of Python DataFrame libraries, highlighting their unique features and the contexts in which they excel for data manipulation.
This project automates the extraction and API exposure of Texas Railroad Commission oil and gas permit data, simplifying access for analysis.
Plotly's Studio AI transforms business intelligence by enabling rapid dashboard creation through a prompt-based interface, merging low-code accessibility with full-code transparency.
The article compares two data analytics solutions: Snowflake and MotherDuck. Snowflake is a comprehensive data warehouse designed for large teams and complex data needs, offering features like materialized views and automated task...
The 2026 F1 regulations have led to varied driver adaptations, highlighting challenges in consistency rather than outright speed after the first qualifying session.
This analysis of MLB Statcast pitch data reveals consistent patterns in pitch behavior and highlights the importance of pitch location and contact quality over pitch selection.
The post discusses common SQL anti-patterns that can hinder query performance and maintainability, emphasizing the importance of proper coding practices. It highlights issues such as excessive use of CASE WHEN statements, improper...
DataSpec simplifies open-source data infrastructure deployment for smaller teams by using Docker containers to integrate essential components efficiently.
AI adoption should be driven by clear business objectives rather than trends, as pursuing it without purpose can lead to unnecessary complexity and costs.
The article discusses the challenges faced by analytics teams in large organizations, particularly the issue of report sprawl where multiple similar reports lead to confusion and inefficiency. It highlights the importance of estab...
Data engineers must grasp the underlying systems, like Linux, to troubleshoot effectively and manage resources beyond the abstractions of cloud services.
Executing a query in DuckDB involves efficient in-process steps that optimize performance and memory usage, making it fast and easy to work with.
DuckPond showcases how DuckDB can serve as a self-hosted query service for local data analysis, combining FastAPI and React for user-friendly SQL execution.
The blog post discusses Snowflake, a data warehousing tool that centralizes data from various sources, making it query-able with SQL. It highlights the benefits of Snowflake, such as cheap and scalable storage, independent compute...
Minicdc is a lightweight Python script designed for smaller analytics teams to facilitate data ingestion into data warehouses without the overhead of deploying complex ETL tools like Fivetran or Airbyte. It utilizes SQLite, DuckDB...
The article discusses the evolution of the modern data stack, particularly in light of Fivetran's merger with dbt Labs. It highlights the shift from a modular design to greater integration within the data ecosystem, while emphasiz...
DuckDB efficiently queries 100 parquet files with 5 billion rows in 15 minutes, showcasing its potential as a scalable analytics solution.
The blog post discusses Ducklake, an open-source metadata catalog extension for DuckDB, highlighting its ability to track schema and data changes in databases. The author provides a step-by-step guide on how to install DuckDB, set...
The article discusses the growing popularity of Polars in the Python data analysis community, highlighting its speed and performance advantages over Pandas, particularly for large datasets. However, it argues that for many analyst...
The blog post discusses the Python dependency management library 'uv', highlighting its advantages over pip, particularly in managing multiple Python versions and dependencies seamlessly. The author shares personal experiences and...
The post discusses the challenges data professionals face when migrating SQL queries between databases, specifically from SQL Server to Snowflake. It emphasizes the inefficiencies of traditional editing methods and advocates for u...
The author defends the relevance of the R programming language in data science, countering the narrative that it is dying in favor of Python. They highlight the contributions of Posit (formerly RStudio) in enhancing the R ecosyste...
The author, Jordan Goodman, shares his journey from graduating with a BA in Economics to working in logistics and eventually transitioning into data analytics and backend development. He highlights the challenges faced in logistic...
The article discusses the pitfalls of prioritizing status symbols in the tech industry, such as high compensation and prestigious company logos. It emphasizes the importance of actual skills and continuous learning over superficia...