fix: parse reasoning output from LM Studio reasoning models by angelplusultra · Pull Request #5584 · Mintplex-Labs/anything-llm

angelplusultra · 2026-05-06T23:10:20Z

Pull Request Type

✨ feat (New feature)
🐛 fix (Bug fix)
♻️ refactor (Code refactoring without changing behavior)
💄 style (UI style changes)
🔨 chore (Build, CI, maintenance)
📝 docs (Documentation updates)

Relevant Issues

resolves #5583

Description

The LM Studio provider was using the generic handleDefaultStreamResponseV2 stream handler, which only reads choices[0].delta.content. LM Studio's OpenAI-compatible endpoint emits reasoning tokens for reasoning models in a separate reasoning_content field (same shape as DeepSeek), so those tokens were silently dropped on both the streaming and non-streaming paths and the UI never showed the model's thinking output.

Streaming (handleStream) — replaced the default handler with a bespoke implementation modeled after the existing DeepSeek/Foundry handlers:

Reads both delta.content and delta.reasoning_content per chunk.
On the first reasoning chunk, prepends a <think> tag and streams subsequent reasoning tokens through verbatim.
On the first content token after a reasoning run, emits the closing </think> tag, flushes the buffered reasoning into fullText, and resumes normal content streaming.
Preserves the existing usage-metric handling, abort handling, and finish_reason termination logic from the default handler.

Non-streaming (getChatCompletion) — added a #parseReasoningFromResponse helper that, when message.reasoning_content is present, prepends <think>{reasoning}</think> to the response content before returning it as textResponse. Same shape the streaming path produces, so downstream rendering is consistent across both modes.

Visuals (if applicable)

Additional Information

Developer Validations

I ran yarn lint from the root of the repo & committed changes
Relevant documentation has been updated (if applicable)
I have tested my code functionality
Docker build succeeds locally

shatfield4

Tested using deepseek/deepseek-r1-0528-qwen3-8b and everything worked as it should. LGTM.

…n-stream parse | preserve todo comment

RailKill · 2026-05-11T15:53:23Z

should the reasoning_content implementations be moved to handleDefaultStreamResponseV2 so that it covers other providers? i think it should be fine since there's if-checks and guards. the GenericOpenAI provider is still using the old code and has TODO comments to use a shared function. that's the point of handleDefaultStreamResponseV2 right?

i'm experiencing this problem with the LocalAI provider, and i'm sure Ollama and other providers have the same problem because they all use OpenAI-compatible endpoints, requiring the same fix. i think this would be a good opportunity to do that

angelplusultra · 2026-05-11T17:30:08Z

should the reasoning_content implementations be moved to handleDefaultStreamResponseV2 so that it covers other providers? i think it should be fine since there's if-checks and guards. the GenericOpenAI provider is still using the old code and has TODO comments to use a shared function. that's the point of handleDefaultStreamResponseV2 right?

i'm experiencing this problem with the LocalAI provider, and i'm sure Ollama and other providers have the same problem because they all use OpenAI-compatible endpoints, requiring the same fix. i think this would be a good opportunity to do that

Yes, there will be some sort of unified refactor for all these identical handleStream methods across providers. It most likely will result in a refactor of handleDefaultStreamResponseV2.

implement bespoke handleStream logic with reasoning output parsing

42f5e2b

angelplusultra linked an issue May 6, 2026 that may be closed by this pull request

[BUG]: LM Studio Provider Does Not Present Reasoning Output #5583

Open

add reasoning output parsing to non-streaming path

8d594af

angelplusultra marked this pull request as ready for review May 6, 2026 23:19

angelplusultra requested a review from shatfield4 May 6, 2026 23:19

shatfield4 approved these changes May 7, 2026

View reviewed changes

shatfield4 assigned angelplusultra May 7, 2026

angelplusultra requested a review from timothycarambat May 7, 2026 02:00

angelplusultra assigned timothycarambat and unassigned angelplusultra May 7, 2026

close <think> on mid-reasoning finish | guard undefined content in no…

924f203

…n-stream parse | preserve todo comment

timothycarambat mentioned this pull request May 9, 2026

[FEAT]: Add the option to switch between thinking modes in the deepseek model #5597

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: parse reasoning output from LM Studio reasoning models#5584

fix: parse reasoning output from LM Studio reasoning models#5584
angelplusultra wants to merge 3 commits into
masterfrom
5583-bug-lm-studio-provider-does-not-present-reasoning-output

angelplusultra commented May 6, 2026 •

edited

Loading

Uh oh!

shatfield4 left a comment

Uh oh!

RailKill commented May 11, 2026 •

edited

Loading

Uh oh!

angelplusultra commented May 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

angelplusultra commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Type

Relevant Issues

Description

Visuals (if applicable)

Additional Information

Developer Validations

Uh oh!

shatfield4 left a comment

Choose a reason for hiding this comment

Uh oh!

RailKill commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

angelplusultra commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

angelplusultra commented May 6, 2026 •

edited

Loading

RailKill commented May 11, 2026 •

edited

Loading

angelplusultra commented May 11, 2026 •

edited

Loading