Methodology — Government-statistics grounding

24-Country Reference Database

Each country's personas come from
that country's official statistics

Income distribution, profession-level wages, household composition, consumption patterns, and core regulations — measured data pulled from each country's government and public statistics offices is the ground truth for persona generation.

🇰🇷

Korea

KOSIS · Korea Customs Service

🇺🇸

United States

U.S. Census Bureau · BLS · Nielsen

🇯🇵

Japan

e-Stat · 総務省 · 厚生労働省

🇬🇧

United Kingdom

ONS · Family Resources Survey

🇩🇪

Germany

Destatis (Statistisches Bundesamt)

🇫🇷

France

INSEE

🇮🇹

Italy

ISTAT

🇪🇸

Spain

INE Spain

🇳🇱

Netherlands

CBS Netherlands

🇨🇦

Canada

Statistics Canada

🇲🇽

Mexico

INEGI

🇧🇷

Brazil

IBGE

🇦🇺

Australia

ABS (Australian Bureau of Statistics)

🇨🇳

China

NBS · 国家统计局

🇹🇼

Taiwan

DGBAS · 行政院主計總處

🇸🇬

Singapore

SingStat · HSA

🇲🇾

Malaysia

DOSM (Department of Statistics)

🇹🇭

Thailand

NSO Thailand · Thai FDA

🇻🇳

Vietnam

GSO Vietnam · Bộ Y Tế

🇮🇩

Indonesia

BPS Statistics Indonesia

🇵🇭

Philippines

PSA Philippines

🇮🇳

India

MoSPI · NSSO

🇦🇪

UAE

Federal Competitiveness Authority

🇸🇦

Saudi Arabia

GASTAT (General Authority for Statistics)

Refreshed annually via an automated GitHub Actions pipeline · Additional countries can be onboarded in 4–6 weeks on request

Why Not Just Ask ChatGPT?

What sets us apart from a generic AI chatbot

You'll get an answer either way. The question is whether the answer is something you can actually take into a boardroom.

Generic AI Chatbot (ChatGPT/Claude/Gemini)

"Will our product sell well in Vietnam?"

×
General knowledge from training cutoff. No specifics on Vietnam GSO 2024 household income distribution or Bộ Y Tế food registration procedures.
×
Reasoning isn't traceable — can't answer "why did you conclude that?"
×
One persona. No intent distribution, segment diversity, or minority-opinion visibility.
×
No quantitative outputs — no price curve, CAC estimate, or market prioritisation.
×
No executive PDF or charts — you have to assemble the deck yourself every time.

AI Market Twin200 personas grounded in government statistics · traceable sourcing
✓Live integration of 24-country official statistics + category-specific regulations (Bộ Y Tế · 厚生労働省 · Thai FDA · UK CPSR, etc.)
✓Every persona statement is traceable to its source data cell — sources listed in every PDF.
✓200-persona intent histograms, segment-level rejection factors, minority-champion visibility.
✓Price-vs-conversion curves, country-level CAC estimates, automatic HIGH/MEDIUM/LOW risk classification.
✓Executive PDF generated in one click (Korean and English).

Verifiable Accuracy

Measurable accuracy,
open scoring rubric

The most reliable way to reduce LLM hallucination is external grounding plus self-evaluation. AI Market Twin runs public-data anchors — universal ones for every origin, plus per-origin national-data providers — and a 5-metric self-scoring pipeline.

Universal grounding anchors — every origin

Hofstede 6-dimension cultural indices

28 countries × Power Distance · Individualism · Uncertainty Avoidance, etc. — calibrates persona decision priors

World Bank macro indicators

GDP per capita PPP · population · household consumption · internet penetration · urbanisation · inflation · Logistics Performance Index — grounding market size, e-commerce reach, retail density, price sensitivity, and import/distribution feasibility

UN Comtrade trade flows

Origin-dynamic export flows by HSCode — any of 24 reporter countries → partners, quantifying existing market interest per category (no longer Korea-only)

Import tariffs (WITS / TRAINS)

MFN applied duty each market levies on the product category (representative HS-6) — hard grounding for price competitiveness and entry cost (e.g. food into Vietnam 33.8% vs UK 0%)

Live per-country demand signal

Google search volume + recent-vs-prior trajectory (DataForSEO) · TikTok hashtag / creator-region · Baidu SERP presence for China — real-time organic demand per candidate market (live sims only; excluded from historical back-tests)

Governance indicators (WGI)

World Bank Worldwide Governance Indicators (0–100): regulatory quality · rule of law · control of corruption · political stability — grounds the regulatory & operational-risk of actually operating in each market

Diaspora affinity (estimate)

The origin country's overseas communities (web-grounded estimate) — a large origin diaspora predicts early-adopter demand and cultural affinity for origin-brand products

National-data providers — per home country

Origin-agnostic by design: each home country plugs in its national equivalents of listed-company financials and food/drug regulatory data. Korea, the United States, Japan, and the United Kingdom are wired today; more markets extend the same interface.

🇰🇷 Korea — DART · Korea Customs · KOTRA

DART consolidated financials + region-segment revenue · 관세청 (data.go.kr) monthly 10-digit HSCode exports · 86-country KOTRA registry of Korean entities abroad

🇺🇸 United States — SEC EDGAR · openFDA

SEC EDGAR companyfacts (XBRL) for listed-company financials · openFDA food / drug enforcement + cosmetics (CAERS) as the regulatory-risk anchor

🇯🇵 Japan — EDINET

EDINET 有価証券報告書 (annual securities reports) — net-sales / brand-level financials for Japanese listed companies

🇬🇧 United Kingdom — Companies House

Companies House registry — confirms a GB-origin brand's UK company exists and is active, with incorporation year (real-world footprint / longevity anchor)

Brand-specific GTM signals — the accuracy lever macro can't see

The back-test showed the true winning market is often decided by brand-level factors an existing footprint or a distribution deal (Shake Shack → UAE via a licensing group) that no macro anchor can detect. Two signals surface them; live-only, weighted into the country ranking.

Structured GTM intake (user-supplied)

You mark the markets where you already sell / have traction, hold a distribution / retail / licensing partner or LOI, or have a founder-team network. The ranker boosts those markets — a partner/LOI market has a proven route to market the data can't otherwise see.

GTM discovery (web-found)

Two Tavily passes: (1) existing footprint — where the brand already ships / is stocked; (2) per-market entry-partner ecosystem — importers / retail / licensing groups a foreign brand can enter through. An estimate, weighed qualitatively.

5-metric self-scoring pipeline

30%

top3Hit

Fraction of simulated top-3 markets that match the actual top-3 ground truth

25%

rankCorrelation

Spearman correlation between simulated market ranking and measured revenue ranking

20%

rejectRecall

Whether markets the brand actively avoided are also rejected by the sim (false-positive avoidance)

15%

confidenceCalibration

Whether STRONG / MODERATE / WEAK labels are calibrated against actual accuracy

10%

trendMatch

Whether predicted market-trend direction agrees with measured data

Confidence via dominance. A STRONG label is not a mere plurality — it requires the winning market to dominate: agreement across independent LLM providers and a clear vote-share margin over the runner-up. This calibration came from the N=19 back-test, where a plurality-only STRONG was over-confident (right ~55%). The rule makes STRONG mean "bet on this", MODERATE "shortlist it", WEAK "don't bet on this alone".

Honest measurement principles

The 5-metric scoring runs as open, auditable logic. Every code change is re-measured on the same product set (multiple K-product fixtures) automatically, and results are verified with paired t-tests for statistical significance. The measurement-improvement cycle runs weekly.

Every sim result lists the grounding anchors used and a per-metric score breakdown directly in the PDF. Accuracy improvements roll out to existing customers automatically — no separate upgrade cost per release.

Pipeline

The 6 stages every simulation runs through

From wizard input to PDF output in 5–7 minutes on average. Each stage uses a different LLM model tuned to its role.

01 — VALIDATING

Input validation + slot planning

Validates product info and pre-allocates 200 persona slots from category-specific profession pools.

02 — REGULATORY

Up-front regulatory check

Inspects sales bans and labelling rules per country and category. Markets where launch is structurally impossible are auto-excluded.

03 — PERSONAS

Pool sampling + voice generation

Reuses matching personas from the workspace pool; only new slots get fresh generation. Each persona's first-person voice quote is generated alongside.

04 — SCORING

Country-level prioritisation

Aggregates intent distribution, rejection factors, and trust signals across 200 personas to score each market on demand, CAC, and competitive intensity.

05 — PRICING

Price curve (3-sample median)

Three parallel pricing simulations with median selection eliminate single-call variance. Per-market price sensitivity broken out separately.

06 — RECOMMEND

Synthesis + self-critique

Vision model analyses any uploaded creative. Self-critique pass automatically verifies macro consistency (best-country alignment, etc.) before finalising the result.

What ChatGPT doesn't know,
AI Market Twin does

Each country's personas come from
that country's official statistics

What sets us apart from a generic AI chatbot

Generic AI Chatbot (ChatGPT/Claude/Gemini)

AI Market Twin

Measurable accuracy,
open scoring rubric

Universal grounding anchors — every origin

National-data providers — per home country

Brand-specific GTM signals — the accuracy lever macro can't see

5-metric self-scoring pipeline

The 6 stages every simulation runs through

Input validation + slot planning

Up-front regulatory check

Pool sampling + voice generation

Country-level prioritisation

Price curve (3-sample median)

Synthesis + self-critique

See an actual result — sample report

What ChatGPT doesn't know,AI Market Twin does

Each country's personas come fromthat country's official statistics

What sets us apart from a generic AI chatbot

Generic AI Chatbot (ChatGPT/Claude/Gemini)

AI Market Twin

Measurable accuracy,open scoring rubric

Universal grounding anchors — every origin

National-data providers — per home country

Brand-specific GTM signals — the accuracy lever macro can't see

5-metric self-scoring pipeline

The 6 stages every simulation runs through

Input validation + slot planning

Up-front regulatory check

Pool sampling + voice generation

Country-level prioritisation

Price curve (3-sample median)

Synthesis + self-critique

See an actual result — sample report

What ChatGPT doesn't know,
AI Market Twin does

Each country's personas come from
that country's official statistics

Measurable accuracy,
open scoring rubric