Richard Demsyn-Jones

2025-10-20 • machine learning natural language processing large language models function calls ai techniques

The post explores the evolution and techniques behind function calling in large language models (LLMs). It discusses how LLMs have improved in natural language tasks, code understanding, and tool usage through various methods such...

2025-09-08 • technology automation machine learning ai

The blog post discusses the concept of agentic AI, which refers to systems that can set goals, plan, reason, and interact with the outside world autonomously. It contrasts agentic systems with traditional tools, highlighting the d...

2025-12-15 • business leadership strategy richard rumelt

Effective strategy is crucial for overcoming challenges, as highlighted by Richard Rumelt's insights in 'Good Strategy Bad Strategy', which the author interprets through personal experience.

2026-03-09 • productivity technology code data science and machine learning ai

Claude Code revolutionizes data projects by streamlining coding and analysis, though effective use requires domain knowledge and careful management of instructions.

2025-04-21 • ai models data quality feature engineering real-time serving

The text discusses the inconsistency between online and offline machine learning models. It highlights the bugs and issues that arise due to the differences in the two settings, using DoorDash's example. It also suggests solutions...

2025-02-03 • monty hall problem golden goat variant bayes' theorem

The text discusses a variation of the Monty Hall problem called the golden goat variation. It explains the scenario and the optimal approach to solve the problem using Bayes' theorem. It also compares the original problem with the...

2024-10-14 • neural networks machine learning reasoning generalization

The text discusses the debate on whether machines can think and create art. It explores the capabilities of machines and the concept of artificial intelligence. It also delves into the training of large language models (LLMs) and ...

2024-09-02 • software development best practices postmortem

The text discusses the importance of postmortems in software development, highlighting common anti-patterns and best practices. It emphasizes the need for accurate documentation, learning from mistakes, and improving systems. The ...

2024-07-22 • machine learning technical debt vendor relationships

The text discusses the parallels between accumulating vendor relationships and accumulating internal technology, particularly in the context of building out a modern ML stack. It highlights the challenges of identifying the best c...

2024-04-08 • machine learning search engine dataset document relevance

The text discusses the limitations of benchmark datasets for learning-to-rank (LTR) and the dominance of the Yahoo dataset in LTR literature. The author argues that the Yahoo dataset has key limitations and proposes the need for m...

2024-03-11 • machine learning dataset benchmark datasets learning to rank machine learning algorithms baidu dataset

The text discusses the importance of benchmark datasets in machine learning literature, particularly in the field of learning-to-rank (LTR). It highlights the impact of benchmark datasets on research, and provides detailed insight...

2024-02-12 • machine learning ctr features bias-variance tradeoff

The text discusses the issue of position bias in features and its impact on machine learning algorithms. It explains how items that have historically been shown high up on lists will have received a lot of attention from users, an...

2023-08-14 • development language learning software testing

The text discusses the challenges of validating language models to ensure they do not generate inappropriate content. It describes the process of testing the language model's autocompletions and the efforts made to ensure it never...

2023-07-17 • machine learning forecasting and financial projections

The text discusses the concept of a 'model of everything' and its application in business. It explains the properties of such models, the challenges in building them, and the example of a system built at Lyft. The text also highli...

2023-06-19 • machine learning algorithm expressiveness inductive bias domain expertise

The text discusses inductive bias and expressiveness in machine learning models. It contrasts architectural decisions in machine learning and explains the structure of a model and the optimization algorithm. It also delves into in...

2023-06-05 • neural networks machine learning feature engineering

The text discusses Deep & Cross Networks (DCNs) and their application in machine learning models. It explains the concept of cross layers, the architecture of DCN-V2, and the advantages of using cross layers in neural networks. Th...

2023-05-22 • neural networks machine learning bias ranking relevance

The text discusses three papers from the last few years in the learning-to-rank (LTR) literature, focusing on two-tower models for ranking problems. The papers contain models and evaluation methods that could be useful for those w...

2023-05-08 • chatgpt neural networks machine learning functions gelu

The rise of GELU as an activation function in large language model (LLM) architectures is discussed. The author explains the importance of activation functions in neural networks and how GELU has become popular. The GELU paper is ...

2023-04-24 • experiment hypothesis testing maximization

The text discusses the concept of profit-maximizing experimentation regimes, critiquing the use of a p < 0.05 criteria and suggesting that a maniacal adherence to minimizing experiment false positives is unproductive. It reviews a...

2023-04-10 • decision making false positives p-values experimental design

The text discusses the use of the p-value threshold of 0.05 in experiment analysis and decision-making. It argues that this threshold is arbitrary and may lead to false positives, emphasizing the need to consider false negatives a...

2023-03-27 • machine learning vector embeddings bert word2vec elmo

The term 'embedding' has become a central concept in machine learning. The author discusses the evolution of the term and its meaning, and how it has become hard to define. The text explores the properties and uses of embeddings, ...

2023-03-13 • probability loss function cross entropy model accuracy visibility and gradient calculations log loss

The text discusses log loss and cross entropy in binary and multiclass problems. It explains the differences between log loss and cross entropy, and the author's concerns about cross entropy. The author also discusses the use of l...

2023-02-27 • development code review software testing

The author discusses the importance of testing in product development and how a small change in the code review process led to a reduction in customer-facing bugs and experiment restarts. The change involved adding a 'Tested' labe...

2023-02-13 • monorepo protocol buffers code repositories thrift store

The text discusses the problem of sharing constants across programming languages and repositories, and the bugs that can arise from inconsistent spelling of strings. It explores the use of Protobuf and Thrift to define constants a...