Question 1

Why LangGraph over LangChain alone?

Accepted Answer

LangChain is excellent for composing primitives. For stateful, multi-step flows that must be observable, replayable and testable, LangGraph adds what's missing: typed state, deterministic transitions, durable checkpoints, first-class human-in-the-loop nodes. We still use LangChain components inside LangGraph nodes where they make sense — the two aren't mutually exclusive. For simple linear chains, LangChain alone is the right call.

Question 2

How do you version-control LangGraph state machines?

Accepted Answer

The graph lives in git like any other code. Prompts are versioned files tied to the eval run that approved them. State schemas are typed (Pydantic or Zod) and reviewed alongside the graph. Every production deploy is from a tagged commit; the prompt registry tracks which prompt hash ran for which trace. No more 'someone edited it in the UI on Friday'.

Question 3

Do you ship eval harnesses?

Accepted Answer

Always. Every LangGraph engagement ships with a versioned golden dataset (50–500 examples per intent), a scoring rubric mixing deterministic and model-graded checks, a regression suite that blocks merges on agreed thresholds, and a model A/B harness covering Claude Sonnet 4.6, Opus 4.7, Haiku 4.5, GPT-4o, GPT-4o-mini and DeepSeek-V3. LangSmith is the default surface; we'll integrate with whatever your team already uses.

Question 4

What about checkpoint-and-resume in production?

Accepted Answer

LangGraph's checkpointer (Postgres or Redis-backed in our deployments) persists state at every node transition. Mid-flow failures resume from the last good checkpoint rather than restarting. Human-in-the-loop pauses are durable — a reviewer can respond hours later and the run resumes cleanly. We treat replayability as a first-class architectural concern, not a nice-to-have.

Question 5

Can we host it ourselves?

Accepted Answer

Yes. LangGraph deploys as a Python or JS/TS service in your cloud account (AWS, GCP, Azure, AU regions where required). State persistence on your Postgres; vector memory on Pinecone, Weaviate, or pgvector — your choice. LangSmith is the standard observability layer; we'll wire OpenTelemetry to Datadog or Sentry alongside it.

Question 6

What's the engagement model?

Accepted Answer

$20K+ implementation, typically 4–10 weeks, scoped against a measurable revenue or cost line. $3K+ MRR retainer covering ops, eval runs, model upgrades, drift detection, dashboards and a monthly architecture review. Source escrow available; you own the code, the prompts, the evals, and the infrastructure.

Production LangGraph, Built Properly

When You Need a LangGraph Consultant

Your prompts have become a state machine in disguise

You need checkpoint-and-resume in production

Human-in-the-loop is load-bearing

Multi-agent orchestration without the spaghetti

Evals in CI on every prompt and model change

Observability that matches a production service

What a Shakan LangGraph Engagement Looks Like

Scoping

Architecture

Build

Eval

Deploy

Retainer

What We Pair With LangGraph

Common LangGraph Use Cases at This Price Tier

Voice AI for an AU healthcare practice

Multi-agent content engine for B2B SaaS

Revenue-ops scoring across HubSpot + Stripe + product

AUSTRAC AML triage for a mortgage broker

Support triage for eCommerce

Operations agent for a professional-services firm

Pricing context

Technical Questions