fix(inference): fail-fast on empty model for cloud providers, demote nvidia-nim Sentry noise by staimoorulhassan · Pull Request #2811 · tinyhumansai/openhuman

staimoorulhassan · 2026-05-28T04:49:40Z

Summary

Closes #2784.

make_cloud_provider_by_slug now rejects a zero-length effective model at factory construction time instead of silently forwarding model="" to the upstream API. Providers like nvidia-nim respond with {"error":{"message":"model field is required",...}} (400), and each agent turn generates a Sentry event (TAURI-RUST-4NM).

Root cause: In make_cloud_provider_by_slug, when the provider string is nvidia-nim: (empty model component) and the [[cloud_providers]] entry has no default_model, effective_model resolved to "". Nothing validated this before forwarding it in the API request body.

Changes:

factory.rs: After resolving effective_model, bail with a clear, actionable message if it's still empty (and the entry is not OpenhumanJwt — JWT entries route to the backend and never forward the model).
config_rejection.rs: Add "model field is required" to the PHRASES list as defense-in-depth.
factory_test.rs: Two new unit tests:
- cloud_provider_with_no_model_and_no_default_rejected — empty provider string + no default_model → clear error naming the slug
- cloud_provider_default_model_used_when_model_part_is_empty — empty provider string + has default_model → factory succeeds, resolves to default_model
config_rejection.rs tests: detects_nvidia_nim_missing_model_body pins the classifier against the real wire body.

Test plan

cargo test -p openhuman -- inference::provider::factory_test passes (two new tests + existing suite unchanged)
cargo test -p openhuman -- inference::provider::config_rejection::tests passes
Manually configure a [[cloud_providers]] entry with slug = "nvidia-nim" and no default_model; attempting to use nvidia-nim: shows the actionable error instead of a provider 400
Same entry with default_model = "meta/llama-3.1-8b-instruct" and provider string nvidia-nim: → factory uses the default model

Summary by CodeRabbit

Bug Fixes
- Improved detection and messaging for cases where a provider model is missing or empty, including clearer error text to help identify misconfigured providers.
- Broadened recognition of common “missing model” phrasing to better classify such errors.
Tests
- Added unit and regression tests to verify handling and classification of missing/empty provider model scenarios.

…nvidia-nim Sentry noise Fixes tinyhumansai#2784. `make_cloud_provider_by_slug` now rejects a zero-length effective model (provider string `nvidia-nim:` with no inline model AND no `default_model` entry) at factory construction time instead of silently forwarding `model=""` to the upstream API. Providers such as nvidia-nim respond with `{"error":{"message":"model field is required",...}}` (400) and Sentry accumulates one event per agent turn (TAURI-RUST-4NM). Changes: - `factory.rs`: bail with a clear, actionable message when `effective_model.trim().is_empty()` after resolution. Actionable path: either supply the model in the provider string (`nvidia-nim:<model-id>`) or set `default_model` in the `[[cloud_providers]]` config entry. - `config_rejection.rs`: add "model field is required" to the `PHRASES` list as defense-in-depth so any body that still reaches the HTTP layer is demoted to `log::info!` rather than Sentry. - `factory_test.rs`: two new unit tests — `cloud_provider_with_no_model_and_no_default_rejected` asserts the early-error path fires with a message naming the slug, and `cloud_provider_default_model_used_when_model_part_is_empty` asserts the happy-path fallback to `default_model` is preserved. - `config_rejection.rs` tests: `detects_nvidia_nim_missing_model_body` pins the "model field is required" classifier.

coderabbitai · 2026-05-28T04:49:52Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds detection for nvidia-nim "model field is required" error bodies and tightens factory handling/error messaging for cloud provider strings with empty model parts; includes unit tests for both detection and factory regression cases.

Changes

Cloud Provider Model Validation and Error Handling

Layer / File(s)	Summary
Config rejection classifier expansion for missing model field `src/openhuman/inference/provider/config_rejection.rs`	`is_provider_config_rejection_message` now matches the case-insensitive phrase `"model field is required"` and shortens an Ollama anchor to `"requires a subscription"`. Unit test `detects_nvidia_nim_missing_model_body` verifies matching both a full nvidia-nim 400 body and the bare phrase.
Factory validation for empty effective_model `src/openhuman/inference/provider/factory_test.rs`, `src/openhuman/inference/provider/factory.rs`	`make_cloud_provider_by_slug` prepends a clearer `"no model configured"` prefix to the empty-effective-model error path. Regression tests assert that `<slug>:` is rejected when no `default_model` (error mentions "no model configured" and the slug) and accepted/resolved when `default_model` exists.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

tinyhumansai/openhuman#2796: Extends is_provider_config_rejection_message matcher and adds related tests, touching the same detection logic.
tinyhumansai/openhuman#2813: Modifies config_rejection.rs matcher lists and tests for provider 400-body patterns similar to this PR.
tinyhumansai/openhuman#2146: Related changes around make_cloud_provider_by_slug handling of empty/invalid model resolution and factory tests.

Suggested labels

sentry-traced-bug

Suggested reviewers

graycyrus
oxoxDev
M3gA-Mind

Poem

I munched a carrot, found a bug so small,
"model field is required" — I hopped to the call. 🐇
Tests now spot the empty slug at night,
Factory speaks clearly and sets things right.

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main changes: preventing empty model failures in cloud providers and reducing nvidia-nim Sentry noise by improved error handling and classification.
Linked Issues check	✅ Passed	The PR implementation addresses the core requirements from `#2784`: prevents empty model field in requests, adds detection phrase to reduce Sentry noise, and includes unit tests covering both rejection and default model scenarios.
Out of Scope Changes check	✅ Passed	All changes are directly scoped to fixing the nvidia-nim empty model issue in factory.rs, factory_test.rs, and config_rejection.rs with no unrelated modifications.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Three lines were formatted in a way cargo fmt rejects: - config_rejection.rs: wrap bare assert! arg onto its own line - factory_test.rs: inline `let err = match` (no line-break before match) - factory_test.rs: inline `let (_, model) = call()` method chain

The empty-model bail! added in TAURI-RUST-4NM fired for openhuman: (auth_style=OpenhumanJwt) slugs, breaking openhuman_slug_routes_to_backend. OpenhumanJwt entries route to the OpenHuman backend and never forward the effective_model to an upstream API, so an empty model is valid for that auth style. Guard the check with auth_style != OpenhumanJwt.

graycyrus

@staimoorulhassan hey! the code looks good to me — solid fix for the nvidia-nim Sentry noise. the fail-fast in make_cloud_provider_by_slug is the right call, and the tests cover both cases cleanly. the config_rejection.rs defense-in-depth phrase is a nice touch too.

however, the Rust Core Coverage (cargo-llvm-cov) CI job is failing (the Run cargo llvm-cov for openhuman core step itself, not a coverage threshold — so this may be an infra flake, but it's blocking the coverage gate from running entirely). once that's green, i'll come back and approve this. let me know if you need any help!

staimoorulhassan · 2026-05-28T07:44:24Z

Hi @graycyrus, thanks for the thorough review! The Rust Core Coverage (cargo-llvm-cov) failure is a pre-existing flaky test completely unrelated to this PR's changes:

thread 'openhuman::approval::gate::tests::pending_for_thread_tracks_request_under_chat_context_and_clears'
panicked at src/openhuman/approval/gate.rs:687
assertion failed: matches!(handle.await.unwrap(), GateOutcome::Allow)

Our PR only touches factory.rs, factory_test.rs, and config_rejection.rs — no changes to approval/gate.rs or anything in the approval domain. The test / Rust Core Tests + Quality job (which runs the same suite) passed cleanly on this PR, so the failure is a timing/race condition in the gate test that fires intermittently under llvm-cov's instrumented build (single-threaded, slower). A re-run should clear it.

staimoorulhassan

_{Commented in CodeRabbit Change Stack}

staimoorulhassan · 2026-05-28T08:21:14Z

Hi @graycyrus — the Rust Core Coverage job failed again with the same pre-existing race in approval/gate.rs:687 (the pending_for_thread_tracks_request_under_chat_context_and_clears test). This is unrelated to any changes in this PR; it surfaces intermittently under llvm-cov's single-threaded instrumented mode. Could you please trigger a re-run of that job? As an external fork contributor I can't re-run CI on this repo. All other checks pass clean. Thanks!

staimoorulhassan

All Seems perfect now!

coderabbitai · 2026-05-28T22:46:41Z

Actionable comments posted: 0

… configured' to factory error

staimoorulhassan requested a review from a team May 28, 2026 04:49

coderabbitai Bot added rust-core Core Rust runtime in src/: CLI, core_server, shared infrastructure. bug labels May 28, 2026

coderabbitai Bot previously approved these changes May 28, 2026

View reviewed changes

staimoorulhassan dismissed coderabbitai[bot]’s stale review via 6168e1c May 28, 2026 04:59

coderabbitai Bot previously approved these changes May 28, 2026

View reviewed changes

staimoorulhassan dismissed coderabbitai[bot]’s stale review via 20a51ec May 28, 2026 06:36

coderabbitai Bot previously approved these changes May 28, 2026

View reviewed changes

oxoxDev self-assigned this May 28, 2026

graycyrus reviewed May 28, 2026

View reviewed changes

oxoxDev removed their assignment May 28, 2026

staimoorulhassan commented May 28, 2026

View reviewed changes

Merge branch 'main' into fix/nvidia-nim-model-field

745ed09

staimoorulhassan dismissed coderabbitai[bot]’s stale review via 745ed09 May 28, 2026 22:45

staimoorulhassan commented May 28, 2026

View reviewed changes

coderabbitai Bot previously approved these changes May 28, 2026

View reviewed changes

fix(inference): add missing phrases to config_rejection and 'no model…

eae48b0

… configured' to factory error

staimoorulhassan dismissed coderabbitai[bot]’s stale review via eae48b0 May 28, 2026 23:24

coderabbitai Bot added the sentry-traced-bug Bug identified via Sentry triage label May 28, 2026

coderabbitai Bot previously approved these changes May 28, 2026

View reviewed changes

Merge branch 'main' into fix/nvidia-nim-model-field

45e5bea

staimoorulhassan dismissed coderabbitai[bot]’s stale review via 45e5bea May 28, 2026 23:56

senamakel approved these changes May 29, 2026

View reviewed changes

senamakel merged commit dad5f42 into tinyhumansai:main May 29, 2026
30 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(inference): fail-fast on empty model for cloud providers, demote nvidia-nim Sentry noise#2811

fix(inference): fail-fast on empty model for cloud providers, demote nvidia-nim Sentry noise#2811
senamakel merged 6 commits into
tinyhumansai:mainfrom
staimoorulhassan:fix/nvidia-nim-model-field

staimoorulhassan commented May 28, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 28, 2026 •

edited

Loading

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

graycyrus left a comment

Uh oh!

staimoorulhassan commented May 28, 2026

Uh oh!

staimoorulhassan left a comment

Uh oh!

staimoorulhassan commented May 28, 2026

Uh oh!

staimoorulhassan left a comment

Uh oh!

coderabbitai Bot commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

staimoorulhassan commented May 28, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

graycyrus left a comment

Choose a reason for hiding this comment

Uh oh!

staimoorulhassan commented May 28, 2026

Uh oh!

staimoorulhassan left a comment

Choose a reason for hiding this comment

Uh oh!

staimoorulhassan commented May 28, 2026

Uh oh!

staimoorulhassan left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

staimoorulhassan commented May 28, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 28, 2026 •

edited

Loading