Published · 2026-05-15 · 9 min read
What works, what breaks, and what nobody talks about when you ship an AI chatbot that needs to speak Kurdish reliably alongside Arabic and English.
TL;DR: Multilingual AI chatbots are mostly a solved problem in 2026 — except for Kurdish. Here's what works, what breaks, and the four-layer pattern I use in production.
The state of Kurdish in modern LLMs
GPT-4o, Claude 3.5+, Gemini 2 Pro all handle Kurmanji and Sorani well enough for everyday business tasks. Where they break:
- Idiomatic phrases — they pattern-match Arabic or Persian.
- Mixed-script content — Sorani Arabic-script vs. Kurmanji Latin-script switching.
- Domain-specific terminology — hospitality, medical, legal.
- Tone — defaults to formal even when your business is casual.
The four-layer pattern
- Detection. Don't trust the user-supplied locale; detect script and language from the message itself. A user might type Sorani in Arabic letters, Kurmanji in Latin letters, or mix all three in one conversation.
- Normalization. Convert dialects, scripts, and numerals to a consistent internal representation before sending to the LLM.
- Prompt engineering with few-shot examples — show the model 4–6 example interactions in the dialect, tone, and terminology your business uses. This is where 80% of quality comes from.
- Output validation — check the model's response is actually in the requested locale (yes, models occasionally answer Sorani questions in Persian). Reject and retry if mismatched.
RTL and bidi text rendering — the boring part that breaks everything
Half the bugs in multilingual chatbots aren't AI bugs at all. They're bidirectional text rendering bugs. When a user sends "Order #1234 من فضلك" the rendering order is browser-dependent. Always test with mixed scripts in production browsers, not just on macOS Safari.
Cost and latency math
Multilingual LLM calls cost the same per token as English calls — but Arabic and Kurdish typically use 30–50% more tokens for the same meaning. Budget accordingly. Mitigation: use a smaller, cheaper model for triage and routing, and only escalate to the larger model when the question warrants it. This pattern saves 60–80% on token cost in production.
What I won't do
Train a Kurdish-specific model from scratch for an SMB. The cost is five figures minimum, the resulting quality is rarely better than prompt-engineered GPT-4o, and the maintenance burden is real. Stick to fine-tuning, prompt engineering, and good evaluations.
Related posts
More on the same theme — Kurdistan SMBs, AI, and the messy practical bits.
9 min read
AI agents vs chatbots: which does your Kurdistan business actually need?
A four-question decision tree, five Kurdistan cases I worked on this year, and the four mistakes I see most. Pick the chatbot first; graduate to an agent only when the math checks out.
9 min read
AI for Kurdistan e-commerce: Shopify, custom, or hybrid?
Eight decision questions to choose between Shopify, custom, and hybrid e-commerce builds in Kurdistan — and where AI fits in each path.
8 min read
AI for clinics and pharmacies in Kurdistan
Three high-leverage AI wins for Kurdistan healthcare — appointment booking, multilingual triage, pharmacy refill orchestration. With privacy considerations and real cost numbers.