MENAValues: A Benchmark for Cultural Alignment and Multilingual Bias in LLMs

I Am Aligned, But With Whom? MENA Values Benchmark for Evaluating Cultural Alignment and Multilingual Bias in LLMs

Authors: Pardis Sadat Zahraei, Ehsaneddin Asgari

📜 Overview

MENAValues is a novel benchmark designed to evaluate the cultural alignment and multilingual biases of large language models (LLMs) with the beliefs and values of the Middle East and North Africa (MENA) region. As AI models are deployed globally, their tendency to reflect predominantly Western, English-speaking perspectives creates significant alignment gaps. This benchmark addresses the critical underrepresentation of MENA perspectives in AI evaluation by providing a scalable framework for diagnosing cultural misalignment and fostering more inclusive AI.

🔍 Key Features

Empirically Grounded Data: Features 864 questions derived from large-scale, authoritative human surveys: the World Values Survey (WVS-7) and the Arab Opinion Index 2022.
Broad Regional Coverage: Captures the sociocultural landscape of 16 MENA countries with population-level response distributions.
Multi-Dimensional Evaluation: Probes LLMs across a matrix of conditions, crossing three perspective framings with two language modes (English and native languages).
Deep Analytical Framework: Moves beyond surface-level responses to analyze token-level probabilities, revealing hidden biases and novel phenomena.
Four Core Dimensions: Questions are organized into four key pillars: Social & Cultural Identity, Economic Dimensions, Governance & Political Systems, and Individual Wellbeing & Development.

📊 Key Findings

Our analysis reveals three critical and pervasive phenomena in state-of-the-art LLMs:

Cross-Lingual Value Shift: Models provide drastically different answers to the same question when asked in English versus a native language (Arabic, Persian, or Turkish), indicating that cultural values are unstably encoded across languages.
Reasoning-Induced Degradation: Prompting models to "think through" their answers by providing reasoning often degrades their cultural alignment, activating stereotypes or Western-centric logic instead of nuanced cultural knowledge.
Logit Leakage: Models frequently issue surface-level refusals to sensitive questions (e.g., "I cannot answer that") while their internal logit probabilities reveal strong, high-confidence hidden preferences, suggesting safety training may only be masking underlying biases.

📈 Evaluation Framework

Our methodology systematically tests models under varying conditions to expose inconsistencies.

Perspectives (Framing Styles)

Neutral: The LLM is queried directly without any identity constraints.
Persona: The model is instructed to adopt a national identity (e.g., "Imagine you are an average Saudi...").
Observer: The model is asked to act as a cultural analyst (e.g., "How would an average Saudi respond...").

Languages

English: The dominant language of AI development.
Native Languages: Arabic, Persian, and Turkish, validated by native speakers.

Reasoning Conditions

Zero-Shot: The model provides a direct, immediate answer without additional reasoning prompts.
With-Reasoning: The model is prompted to provide a brief explanation before its answer.

Core Metrics

$NVAS$: Normalized Value Alignment Score — Measures alignment with ground-truth human values.
$CLCS$: Cross-Lingual Consistency Score — Measures response consistency between English and native languages.
$FCS$: Framing Consistency Score — Measures response consistency across Persona and Observer framings.
$SPD$: Self-Persona Deviation — Measures how much a model's response changes when assigned a persona.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.idea		.idea
Code		Code
Data		Data
LICENSE		LICENSE
Logo.png		Logo.png
Main Fig.png		Main Fig.png
PCA 1.png		PCA 1.png
PCA 2.png		PCA 2.png
PCA 3.PNG		PCA 3.PNG
README.md		README.md
Results.png		Results.png
index.html		index.html
main.png		main.png
metrics.png		metrics.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MENAValues: A Benchmark for Cultural Alignment and Multilingual Bias in LLMs

I Am Aligned, But With Whom? MENA Values Benchmark for Evaluating Cultural Alignment and Multilingual Bias in LLMs

📜 Overview

🔍 Key Features

📊 Key Findings

📈 Evaluation Framework

Perspectives (Framing Styles)

Languages

Reasoning Conditions

Core Metrics

About

Uh oh!

Releases

Packages

Languages

License

llm-lab-org/MENA-Values-Benchmark-Evaluating-Cultural-Alignment-and-Multilingual-Bias-in-Large-Language-Models

Folders and files

Latest commit

History

Repository files navigation

MENAValues: A Benchmark for Cultural Alignment and Multilingual Bias in LLMs

I Am Aligned, But With Whom? MENA Values Benchmark for Evaluating Cultural Alignment and Multilingual Bias in LLMs

📜 Overview

🔍 Key Features

📊 Key Findings

📈 Evaluation Framework

Perspectives (Framing Styles)

Languages

Reasoning Conditions

Core Metrics

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages