About:

Matheus Pedroni is a data scientist and developer from Brazil interested in machine learning and data analysis.

Website:

Specializations:

Interests:

Machine learning Data analysis Software development

Outgoing Links:

Simon Willison
Subscribe to RSS:
The blog post discusses the author's experience building a Model Context Protocol (MCP) server that integrates linear regression capabilities into AI models. It explains what MCP is, its purpose in standardizing connections betwee...
The author discusses the implementation of an AI docstring generator for Python code using LLMs, explaining the reasons for reinventing the wheel and the approach taken to generate consistent and user-friendly docstrings.
This post is the third in a series about decision trees in the context of machine learning. It explores bagging, a popular strategy to reduce model variance, and random forests, an algorithm that uses decision trees and bagging. I...
This post is the second in a series about decision trees in the context of machine learning. It discusses the implementation of classification and regression trees (CART) in Python, using Python's standard library and numpy and pa...
The text provides an introduction to decision trees in the context of machine learning, explaining their structure, types, algorithms, and objective functions. It discusses the limitations and advantages of decision trees, as well...
Regression to the mean (RTM) is a statistical tendency for extreme measurements to be followed by values closer to the mean. This concept is explained in depth, including examples and its implications in various fields such as med...
The text compares the implementation of lists in Elixir and Go, discussing the performance of different operations and the influence of programming language design on code. It also explains the time complexity of list operations a...
The text explains how to encrypt a computer using LUKS and TPM + password. It describes the motivation for encrypting the main disk and the steps to configure a LUKS partition to use a cryptographic key stored on the TPM chip comb...
The text provides a detailed guide on how to set up and use an Apache Spark cluster on a local network. It includes instructions on configuring the server, installing Apache Hadoop, installing and configuring Apache Spark, and con...
The text discusses the use of survival analysis with Cox regression to predict heart failure mortality in a public dataset. It explains the process of building survival curves and a predictive model based on survival probability. ...
The text discusses the importance of accurately predicting the risk of mortality in hospitals and healthcare services. It analyzes a dataset with common hospital-related variables and death outcome in patients with heart failure. ...
The text explains the effectiveness of a vaccine, clarifying misconceptions about it. It discusses the relative risk reduction and the probability of getting sick after vaccination. It also explains that the effectiveness is calcu...
The text discusses the use of p-values in scientific research and the misconceptions surrounding them. It explains the definition of p-values and what they are not, such as the probability that the results are due to chance or tha...
The text discusses the use of chloroquine as a treatment for COVID-19, highlighting the lack of strong evidence supporting its effectiveness. It explains the difficulty in interpreting medical evidence and the tendency to overesti...
Hydroxychloroquine was once considered a potential treatment for COVID-19, but multiple studies have found no evidence of its effectiveness. The miscommunication between the scientific community and the public has led to the wides...
The text explains the basics of outlier detection and removal, highlighting common mistakes. It discusses the reasons for removing outliers and the methods to detect them, such as Z-scores, IQR, and MAD. It also warns about the da...
The text is about using Recurrent Neural Networks to classify the Sentiment140 dataset into positive or negative tweets. It discusses the pre-processing steps, splitting the data into train, test, and validation sets, and building...
The text discusses sentiment classification in NLP using the Sentiment140 dataset to train a classifier model in Python. It covers pre-processing, vectorizing, and interpreting the model's performance. The logistic regression mode...
The text provides an exploratory data analysis of the WHO suicide statistics dataset, covering various aspects such as gender, age, and country-specific trends. It highlights the need for caution in interpreting the data and empha...
The text explores fractal analysis using PyTorch to analyze 3D fractals and generate Mandelbulbs. It also discusses the box-counting algorithm, fractal dimension, and lacunarity. The code for fractal analysis is presented in the t...
The text discusses the development of an algorithm to play battleship in the best possible way. The author explains the code they wrote for this problem and the different modes of the algorithm. They also discuss the issues and im...