03 · AI Engineering

One question. Three AIs. Instantly.

Ask a question — and see how Claude, GPT and a local model answer in parallel. This same choice is what we build into your systems.

Anthropic Cloud

claude-haiku-4-5

A CRM bundles customers, leads and service tickets in one place instead of scattering them across emails and spreadsheets. That saves time, prevents duplicate work, and makes revenue, pipeline and service quality measurable.

Sample answer 0.84 s ~ €0.0001
OpenAI Cloud

gpt-4o-mini

With a CRM your sales and service teams always have the full customer context — sales opportunities slip less often, and you can see, with data, where to invest next.

Sample answer 0.71 s ~ €0.0001
Llama 3.1 Local

llama-3.1-8b · Groq

A CRM holds all customer contacts, orders and open items in one place. Every department can see what is going on, instead of gathering info from ten spreadsheets.

Sample answer 0.33 s €0.00

What we do with AI

Two disciplines. Four use cases.

AI integration

Build AI into your systems.

Copilots in Dynamics, AI assistants in your software, automated classification, summarization, extraction. Right in the workflow, not in a separate chat window.

  • Copilot Studio
  • Custom Agents
  • RAG

AI interfaces

Connecting different AIs.

Cloud LLMs, local models, mixed setups. A provider-agnostic layer that keeps switching open — and the data you want to keep, local.

  • Anthropic
  • OpenAI
  • Azure OpenAI
  • Llama · Mistral

Four concrete use cases

Document analysis

Contracts, reports, specifications — evaluated structurally.

CRM enrichment

Structuring notes, classifying email, summarizing cases.

Coding accelerator

AI in our build process — faster, more precise, documented.

Knowledge search

RAG over internal documents — precise answers, with sources.

Data sovereignty

When data cannot leave the building.

Local LLMs on your hardware or in your EU cloud. GDPR-clean, low latency, no dependency on US providers. You keep your data — and our consulting tells you which models really fit which use case.

  • Llama 3.x
  • Mistral
  • Qwen
  • Ollama · vLLM
// Provider-agnostic
const ai = new SoHo.AI({
  primary:   'claude-haiku',
  fallback:  'gpt-4o-mini',
  sensitive: 'local-llama',
});
// Switch providers
// without changing code.

Let's talk about your AI use case.

30 minutes. Honest assessment of whether and where AI really makes sense for you.