Context Engineering: Isaac Miller on Context Engineering with DSPy

Tags: AI context DSP engineering software

Entities: Chroma DSP Isaac Jeff LLM Noah Ze RAMP Shopify Toby

Summary

Introduction
Isaac introduces himself as a core contributor to DSP and thanks Jeff, the Chroma team, and RAMP.
The talk focuses on using DSP for context engineering.
Context Engineering
Context engineering is about providing necessary context for tasks to be solved by LLMs.
The field is still learning how to build reliable AI software.
Challenges include handling messy data formats and ensuring consistent LLM responses.
Example: Email Information Extraction
An internal productivity app example is used to illustrate context engineering challenges.
The app needs to extract important events and prioritize action items from emails.
Challenges include handling varied data formats like strings and images.
Prompting Challenges
Prompts can be brittle and difficult to manage as they combine multiple design choices.
Different models may require different output formats like JSON or XML.
Inference strategies and output formats need to be flexible and adaptable.
DSP Framework
DSP is a framework for programming language models, not just prompting them.
Signatures in DSP allow for structured function definitions.
Modules in DSP enable swapping inference strategies like chain of thought or react.
Optimizers in DSP tune system prompts and weights for better performance.
Conclusion
DSP simplifies the process of building reliable AI software by handling the plumbing, scaling, and learning.
The open-source community is encouraged to join and contribute.

Transcript

00:00

[Music] All righty. Hi everyone.

My name is Isaac and I'm a core contributor to DSP. So I want to start off quick thank you to Jeff and the Chroma team for setting up this awesome event and thank you to

00:16

RAMP for hosting us here. So today I'll be talking about using DSP for context engineering.

Let's get right to it. So why are we talking about context engineering?

So, this term is fairly new, but really context engineering's

00:31

popularity is a symptom of the problem that it's trying to solve, which is that as a field, we're still learning how to build reliable AI software. Here's a tweet from Toby, the CEO of Shopify, where he defines context

00:46

engineering as the art of providing all the context for the task to be plausibly solved by the LLM. I'm sure you're all familiar with this tweet with this tweet.

So then kind of a follow-up to that is that what are the actual challenges that you need to solve when you're doing context engineering.

01:03

Let's look at this from an example of extracting information from an email. So for this example, you're writing an internal productivity app to digest email threads per employee.

So this app needs to do two things. The first it needs to do the first thing it needs to

01:19

do is to collect all the important events from whatever email you're passing in. Then you want to extract and prioritize all the relevant action items from this email.

And so given this information, our app could do a number of things. It could propose focus blocks.

It could um propose meetings in

01:38

order to follow up on these action items. There are a number of possibilities.

So if we were to build this task, what does it actually entail? Well, we have three input fields, right?

The subject, email thread, and attachments. These are fairly arbitrary that I selected.

And if