Skip to content
SerdarDilshad

Published · 2026-05-15 · 9 min read

What works, what breaks, and what nobody talks about when you ship an AI chatbot that needs to speak Kurdish reliably alongside Arabic and English.

TL;DR: Multilingual AI chatbots are mostly a solved problem in 2026 — except for Kurdish. Here's what works, what breaks, and the four-layer pattern I use in production.

The state of Kurdish in modern LLMs

GPT-4o, Claude 3.5+, Gemini 2 Pro all handle Kurmanji and Sorani well enough for everyday business tasks. Where they break:

  • Idiomatic phrases — they pattern-match Arabic or Persian.
  • Mixed-script content — Sorani Arabic-script vs. Kurmanji Latin-script switching.
  • Domain-specific terminology — hospitality, medical, legal.
  • Tone — defaults to formal even when your business is casual.

The four-layer pattern

  1. Detection. Don't trust the user-supplied locale; detect script and language from the message itself. A user might type Sorani in Arabic letters, Kurmanji in Latin letters, or mix all three in one conversation.
  2. Normalization. Convert dialects, scripts, and numerals to a consistent internal representation before sending to the LLM.
  3. Prompt engineering with few-shot examples — show the model 4–6 example interactions in the dialect, tone, and terminology your business uses. This is where 80% of quality comes from.
  4. Output validation — check the model's response is actually in the requested locale (yes, models occasionally answer Sorani questions in Persian). Reject and retry if mismatched.

RTL and bidi text rendering — the boring part that breaks everything

Half the bugs in multilingual chatbots aren't AI bugs at all. They're bidirectional text rendering bugs. When a user sends "Order #1234 من فضلك" the rendering order is browser-dependent. Always test with mixed scripts in production browsers, not just on macOS Safari.

Cost and latency math

Multilingual LLM calls cost the same per token as English calls — but Arabic and Kurdish typically use 30–50% more tokens for the same meaning. Budget accordingly. Mitigation: use a smaller, cheaper model for triage and routing, and only escalate to the larger model when the question warrants it. This pattern saves 60–80% on token cost in production.

What I won't do

Train a Kurdish-specific model from scratch for an SMB. The cost is five figures minimum, the resulting quality is rarely better than prompt-engineered GPT-4o, and the maintenance burden is real. Stick to fine-tuning, prompt engineering, and good evaluations.

More on the same theme — Kurdistan SMBs, AI, and the messy practical bits.

SB

Serdar Dilshad

AI Automation Specialist & Software Engineer · Duhok, Kurdistan

About Serdar →

Available — Q3 2026

Free 15-minute scoping call. No obligation. We'll figure out together whether AI or automation actually helps your business — and where it doesn't.