Blog

illustrations illustrations illustrations illustrations illustrations illustrations
Hallucinations in LLMs: Mitigating Factors and Mechanistic Interpretation

Hallucinations in LLMs: Mitigating Factors and Mechanistic Interpretation

Hallucinations in Large Language Models

Introduction

Large Language Models (LLMs) have shown remarkable capabilities in generating human-like text, but they are prone to producing “hallucinations” - confident assertions that are factually incorrect or nonsensical. This post explores the phenomenon of hallucinations in LLMs, examining their underlying causes, mitigation strategies, and attempts at mechanistic interpretation.

Read More
LLM Poisoning: Attacks and Defenses Against Large Language Models

LLM Poisoning: Attacks and Defenses Against Large Language Models

LLM Poisoning: Attacks and Defenses

Introduction

Large Language Models are vulnerable to various forms of poisoning attacks that can compromise their integrity, reliability, and safety. Understanding these attack vectors and developing robust defenses is crucial for maintaining trust in AI systems.

Read More
LLM SEO: Optimizing Content for Large Language Models

LLM SEO: Optimizing Content for Large Language Models

LLM SEO: Optimizing Content for Large Language Models

Introduction

As Large Language Models become central to information retrieval and content generation, a new form of Search Engine Optimization (SEO) has emerged. LLM SEO focuses on optimizing content to be effectively indexed, ranked, and retrieved by AI systems, rather than traditional search engines.

Read More
LLM Steering: Controlling Model Behavior and Outputs

LLM Steering: Controlling Model Behavior and Outputs

LLM Steering: Controlling Model Behavior

Introduction

As Large Language Models become more powerful, the ability to reliably steer their behavior becomes increasingly important. LLM steering encompasses various techniques to guide model outputs toward desired outcomes while maintaining coherence and usefulness.

Read More
Prompt Trajectory: Human vs Machine Generated Prompts

Prompt Trajectory: Human vs Machine Generated Prompts

Prompt Trajectory: Human vs Machine Generated Prompts

Introduction

The way prompts are crafted and processed by Large Language Models reveals fundamental differences between human and machine-generated content. Understanding prompt trajectories - how prompts evolve and are interpreted through the AI system - provides insights into human-AI interaction patterns and optimization opportunities.

Read More
Scaling Factors and Emergent Behavior in Large Language Models

Scaling Factors and Emergent Behavior in Large Language Models

Scaling Factors and Emergent Behavior in LLMs

Introduction

The dramatic improvements in Large Language Model performance have largely been driven by scaling - increasing model size, training data, and computational resources. This scaling has led to the emergence of unexpected capabilities, challenging our understanding of intelligence and learning.

Read More
How to finetune a model then deploy it to the web: Part 1 - 'Training by finetuning'

How to finetune a model then deploy it to the web: Part 1 - 'Training by finetuning'

Overview

One problem with getting started with ML nowadays is how daunting some of the ‘interesting’ things can be. The models are large, often too large for regular computers. This is a two-part blog on how one might work with a larger model in a manageable way. I’m on a Macbook Pro M2 with 16gb of RAM, which admittedly is quite powerful, but this method should be repeatable on most hardware. This should be able to serve as somewhat of an introduction to ML. This will be especially helpful if you have prior programming experience, but have yet to apply itto ML.

Read More
Machine Learning (Usually) isn't the Right Solution

Machine Learning (Usually) isn't the Right Solution

This is a weird one. As someone who has made their living from machine learning this probably isn’t what you’d expect. But really, most problems don’t need to use machine learning! I’ll help you come up an intuition to understand the types of problems machine learning tends to solve well. This applies equally if you are a developer or not, there is no required technical know-how.

Read More