AI, ML & data analytics

What's included

A few kinds of AI & data work we take on.

We focus on the boring-but-important parts: making sure the answers are right, the cost is sensible, and the thing keeps working when it's been live for a month.

LLM features in your product

Chat, summarisation, drafting, classification — added to your existing product without breaking what already works.

RAG & retrieval apps

Answering questions over your own data, with the citations, chunking, and guardrails needed to keep the answers honest.

Data pipelines & analytics

Getting your data from wherever it lives into a place where it's useful — and then keeping it clean as the schema drifts.

Internal AI tools

Tools for your own team — analyst copilots, draft generators, classifiers — that save real time on real work.

Evaluations & guardrails

Eval harnesses, regression tests, and safety checks so you actually know whether a prompt change made things better or worse.

Classification & extraction

Pulling structured data out of unstructured input — emails, PDFs, transcripts — with measurable accuracy you can defend.

Every engagement also includes

An honest baseline — measured before we touch anything
Cost and latency budgets, with monitoring so you'd notice if they break
An evaluation harness for your prompts and outputs
Fallback paths for when the model is wrong, slow, or unavailable
Documentation of the trade-offs we made and why

How we work

Short cycles, plain communication.

We start small. Most AI projects fail not because the model is wrong, but because nobody checked the answers against real use.

01
Understand
What's the job to be done, who's the user, and what does "good enough" look like? We get specific before writing code.
02
Prototype
A working prototype with real data, so you can see how it performs before committing to a bigger build.
03
Harden
Evaluation, cost controls, fallbacks, and monitoring — the parts that decide whether it survives in production.
04
Hand over
We document the prompts, the evals, and the trade-offs — so your team can iterate without us.

Tech we use

Boring, proven tools — used well.

We pick tools we trust to be maintainable for the long haul. If you already have a stack, we'll usually meet you there.

Languages

PythonTypeScript

Models & APIs

AnthropicOpenAIOpen-source LLMs

Data

Postgrespgvectordbt

Tooling

LangChainLlamaIndexStreamlit

AI that actually works in production.

A few kinds of AI & data work we take on.

LLM features in your product

RAG & retrieval apps

Data pipelines & analytics

Internal AI tools

Evaluations & guardrails

Classification & extraction

Every engagement also includes

Short cycles, plain communication.

Understand

Prototype

Harden

Hand over

Boring, proven tools — used well.

Got an AI idea you'd like a sanity check on? Let's talk.