Question 1

Will it hallucinate?

Accepted Answer

Not if it's built right. The standard pattern that hallucinates is 'give the model a question and hope.' Our pattern is retrieval-augmented: the model can only answer using passages we've fetched from your real documents, and every answer cites the source. It will refuse questions it doesn't have grounded data for, which is the correct behaviour even if it feels unfamiliar at first.

Question 2

OpenAI or Claude, which should we use?

Accepted Answer

Both are excellent and the gap is smaller than vendor marketing suggests. Claude tends to do better on long-document reasoning, careful instruction-following, and cases where you want it to refuse or hand off. OpenAI often wins on agentic tool use, multimodal (vision), and the o-series for complex reasoning. We frequently ship with both behind a single API and route per query type. We have no reseller relationship with either.

Question 3

What happens to our private data?

Accepted Answer

By default, both the OpenAI and Anthropic APIs contractually do not train on your data. Your documents stay in your vector store (we usually self-host pgvector inside your infrastructure for sensitive cases). For regulated industries we can run open models (Llama, Mistral) on your own infrastructure with no third-party API calls at all.

Question 4

Can we run it on-prem or in our own cloud?

Accepted Answer

Yes. Frontier models (GPT, Claude) call out to API endpoints, but everything around them (the vector store, the orchestration, the logging, the UI) runs in your infrastructure. For air-gapped deployments we use open-weight models (Llama 3.x, Mistral, Qwen) deployed on your own GPU instances.

Question 5

How do you handle GDPR and SOC 2 requirements?

Accepted Answer

Standard practice on every engagement: PII redaction before any data goes to a model, regional API endpoints (EU for UK and EU clients), a Data Processing Agreement on your terms, and full audit logging of every query and response. We can produce the documentation auditors need.

Question 6

What's the smallest agent project you'll take?

Accepted Answer

A two to three week proof of value: one well-scoped use case, one source corpus, one channel (web widget, Slack or internal tool). Usually £8k to £15k or $10k to $18k. If it works we extend. If it doesn't we tell you why and stop.

Custom AI that actually answers, not vague chat that escalates everything.

The chatbot you tried last year was a glorified contact form.

How we build agents that ship

Define what good looks like

Ground it in your real data

Wire in guardrails and escalation

Ship, evaluate, iterate

Stack

What this looks like in practice

Customer support agent for a UK fintech

Internal knowledge agent for a US consultancy

Inbound voice agent for a UK trades business

Frequently asked questions

See what an agent grounded in your data could do.