Pr/multilingual by raghavm243512 · Pull Request #121 · ServiceNow/eva

raghavm243512 · 2026-05-18T21:35:49Z

initial multilingual version

Easily extendable to many language using the add_culture_data script. This will do translation, gender consistent naming, suggest names, extend data, etc. So if anyone wants to run a language not committed in EVA data, it is trivially easy to do so
Readme section showing basic of adding a language.
Tested end to end once, but I don't have 11lab login and ran out of credits on free one (needed a custom agent for foreign lang)

Still TODO:
Currencies
Actually committing the translations (didn't want to burn credits until finalized)
Analysis
Testing a large variety of models to ensure they actually get the language code they expected (es-MX vs es, for example)

katstankiewicz

can you also add ensure_ascii=False to AuditLog save()

katstankiewicz · 2026-05-21T16:01:30Z

+def get_initial_message(language: str) -> str:
+    """Return the assistant's opening line for ``language``.
+
+    Falls back to English. Raises if even English is missing (data quality).


Suggested change

Falls back to English. Raises if even English is missing (data quality).

Falls back to English. Raises if even English is missing.

katstankiewicz · 2026-05-21T16:07:13Z

+    user_message = resolve_user_goal(
+        record.user_goal,
+        record.culture_overrides,
+        os.getenv("EVA_LANGUAGE", "en"),


Suggested change

os.getenv("EVA_LANGUAGE", "en"),

language,

katstankiewicz · 2026-05-21T16:07:37Z

+    goal = resolve_user_goal(
+        record.user_goal,
+        record.culture_overrides,
+        os.getenv("EVA_LANGUAGE", "en"),


Suggested change

os.getenv("EVA_LANGUAGE", "en"),

language,

and add language to build_user_sim_prompt

katstankiewicz · 2026-05-21T16:31:09Z

                ),
                audit_log=audit_log,
                api_key=params["api_key"],
+                base_url=params.get("url", ""),


Suggested change

base_url=params.get("url", ""),

we don't need a url for openai services

katstankiewicz · 2026-05-21T16:51:43Z

+from eva.utils.culture import FIRST_NAME_PLACEHOLDER, LAST_NAME_PLACEHOLDER
+from eva.utils.json_utils import extract_and_load_json
+from eva.utils.llm_client import LLMClient
+from eva.utils.logging import get_logger


Suggested change

from eva.utils.logging import get_logger

from eva.utils.logging import get_logger, setup_logging

katstankiewicz · 2026-05-21T16:52:05Z

+from eva.utils.logging import get_logger
+from eva.utils.router import init
+
+logger = get_logger(__name__)


adds logging (previously was only printing warning level or above)

Suggested change

logger = get_logger(__name__)

setup_logging()

logger = get_logger(__name__)

katstankiewicz · 2026-05-21T16:54:24Z

+
+    # 3. Write back atomically.
+    if dry_run:
+        logger.info(f"[dry-run] would update {dataset_path} ({len(records)} records)")


Suggested change

logger.info(f"[dry-run] would update {dataset_path} ({len(records)} records)")

logger.info(f"[dry-run] would update {dataset_path} ({len(target_ids)} records)")

katstankiewicz · 2026-05-21T17:03:54Z

+def _load_names_file(path: Path) -> dict[str, list[str]]:
+    data = json.loads(path.read_text(encoding="utf-8"))
+    for key in ("male_first", "female_first", "last"):
+        if not isinstance(data.get(key), list) or not data[key]:


should this check that there are enough names as well? ie BUCKET_SIZE?

raghavm243512 force-pushed the pr/multilingual branch from bd0e0d9 to 81923ab Compare May 19, 2026 18:39

raghavm243512 added 3 commits May 19, 2026 14:21

initial multilang impl

a4bcb4d

test fix

8ac2aaa

date formats

f5a5b52

raghavm243512 force-pushed the pr/multilingual branch from 68fb05a to f5a5b52 Compare May 19, 2026 21:28

translations and supporting stuff

ada9699

raghavm243512 force-pushed the pr/multilingual branch from 606dc7d to ada9699 Compare May 20, 2026 23:38

Apply pre-commit

aa152bd

katstankiewicz reviewed May 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pr/multilingual#121

Pr/multilingual#121
raghavm243512 wants to merge 5 commits into
mainfrom
pr/multilingual

raghavm243512 commented May 18, 2026 •

edited

Loading

Uh oh!

katstankiewicz left a comment

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

katstankiewicz May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	Falls back to English. Raises if even English is missing (data quality).
	Falls back to English. Raises if even English is missing.

	from eva.utils.logging import get_logger
	from eva.utils.logging import get_logger, setup_logging

	logger = get_logger(__name__)
	setup_logging()
	logger = get_logger(__name__)

	logger.info(f"[dry-run] would update {dataset_path} ({len(records)} records)")
	logger.info(f"[dry-run] would update {dataset_path} ({len(target_ids)} records)")

Conversation

raghavm243512 commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

katstankiewicz left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

raghavm243512 commented May 18, 2026 •

edited

Loading