About:

Jeroen Janssens is a data science expert and instructor passionate about teaching and building ML solutions.

Website:

Specializations:

Interests:

Data science Machine learning Visualizing data Teaching

Incoming Links:

Outgoing Links:

Evan Miller
Subscribe to RSS:
Jeroen Janssens announces his new role as Head of Developer Relations at Posit PBC, emphasizing the establishment of a voluntary DevRel Guild to support and amplify the developer community's engagement in activities like talks, bl...
Jeroen Janssens presents a cheatsheet for Plotnine, a Python package for data visualization, summarizing essential information for creating plots and figures. The cheatsheet is designed to be a quick reference guide, emphasizing t...
Jeroen Janssens reflects on his time at Xomnia, highlighting personal achievements and experiences during his two years at the company. He expresses gratitude for the opportunities and the people he worked with, and looks forward ...
The article discusses the challenges of visualizing high-dimensional data and the use of dimensionality reduction algorithms, specifically UMAP, to gain insight into the structure of the data. The author applies UMAP to the MNIST ...
The text is a blog article about the data visualization package Plotnine, which is based on the grammar of graphics and is similar to ggplot2. The author, Jeroen Janssens, discusses the advantages of plotnine over other data visua...
Jeroen Janssens and Thijs Nieuwdorp are writing a book titled 'Python Polars: The Definitive Guide' which is expected to be about 400 pages and to hit the shelves in Q3 2024. The book will cover Polars, a highly performant DataFra...
The article discusses the revival of a playground for stem-and-leaf plots made by the author 10 years ago. It explains the purpose of the playground and how it updates as values are changed. The example data comes from John Tukey’...
The text is an archive of an online course called Embrace the Command Line, created by Jeroen Janssens. The course was designed to help developers and researchers get started with the command line, and was based on Janssens' book ...
The blog article discusses how the author set up key mappings in iTerm to improve the iTerm+tmux experience. The author wrote a Python script to generate the corresponding JSON programmatically to avoid defining the key mappings m...
Jeroen Janssens is selling 16 domains, including datascienceworkshops.com, as he has closed his company Data Science Workshops and no longer needs them. He hopes to find a new owner who can put them to good use.
Jeroen Janssens announces the closure of his company Data Science Workshops B.V. after nearly seven years of teaching data science topics. He reflects on the rewarding experiences and challenges of running his own training company...
The text is about Jeroen Janssens building a Lego table for his kids and himself. He describes the process and highlights of the project, including using plywood, creating insets, and spray painting the table.
The article discusses how to scrape multiple pages in R and Rvest using the rvest package and purrr package. It provides examples of scraping Stack Overflow questions tagged R, Lego Star Wars sets, and titles from Hacker News. The...
The article discusses the heuristics for translating ggplot2 code to plotnine code. It explains that while plotnine code is different from ggplot2 code due to Python and R having different syntax and mechanics, 95% of ggplot2 code...
The article discusses the plotnine data visualization package for Python, which is based on the grammar of graphics. The author compares plotnine to other popular data visualization packages for Python and discusses its functional...
The article discusses a Python script called csv2vw created by Jeroen Janssens, which converts CSV data to Vowpall Wabbit’s input format. The script is available on GitHub in the dsutils repository and includes examples of its usa...
The text is an interview with Jeroen Janssens, a data scientist, about his background, work at YPlan, and his book and toolbox projects. He discusses his interest in data, his work at YPlan, and the content of his book. He also ta...
The article discusses the potential of IBash Notebook as a convenient environment for doing data science. It explores the idea of publishing a book as a collection of notebooks and the challenges associated with it. It also highli...
The article discusses the importance of the Unix command-line for data scientists and describes the author's book 'Data Science at the Command Line'. It also provides details on creating a data science toolbox environment using Va...
The text is a blog article by Jeroen Janssens about the Stochastic Outlier Selection (SOS) algorithm, which is an unsupervised outlier-selection algorithm that takes as input either a feature matrix or a dissimilarity matrix and o...
The article discusses seven command-line tools for data science, including jq, json2csv, csvkit, scrape, xml2json, sample, and Rio. The author explains how each tool can be used and provides examples. The article emphasizes the po...
The article discusses the importance of headline testing in digital media and how the Visual Revenue platform provides tools to optimize front page headlines. It explains the challenges of headline testing and compares the frequen...