fix: close stream after upstream ends to prevent client hang by octo-patch · Pull Request #186 · sqlchat/sqlchat

octo-patch · 2026-04-26T02:44:40Z

Fixes #176

Problem

When an LLM provider (e.g. Ollama) does not send a data: [DONE] sentinel at the end of the SSE stream, the ReadableStream created in src/pages/api/chat.ts was never explicitly closed. The controller.close() call was only reachable inside the SSE event handler when data === "[DONE]", so providers that omit this terminator left the stream permanently open. On the client side, the reader.read() loop in ConversationView then blocked indefinitely, causing the UI to appear frozen with the message stuck in LOADING state.

Two additional issues accompanied this:

When the last streaming chunk has an empty delta (finish_reason: "stop"), delta?.content is undefined. Passing undefined to TextEncoder.encode() can produce unexpected bytes in some runtimes instead of being a no-op.
If the stream was closed with an error (controller.error()), the unhandled promise rejection in the frontend left the message permanently stuck in LOADING.

Solution

src/pages/api/chat.ts

After the for await loop that exhausts the upstream response body, call controller.close() inside a try/catch. The catch silently ignores the case where [DONE] already closed the controller.
Guard the encoder.encode(text) / controller.enqueue(queue) calls behind an if (text) check so empty-delta chunks are skipped cleanly.

src/components/ConversationView/index.tsx

Wrap the while (!done) stream-reading loop in a try/catch. If the stream signals an error, the assistant message is marked FAILED with an explanatory message rather than remaining stuck in LOADING.

Testing

Verified with standard OpenAI streaming: the [DONE] path still closes the stream normally; the post-loop controller.close() is a no-op (caught and ignored).
For providers that omit [DONE], the stream now closes cleanly once all upstream bytes are consumed.

…qlchat#176) When an LLM provider (e.g. Ollama native API) does not send a [DONE] sentinel at the end of the stream, the ReadableStream in the API handler was never explicitly closed. This left the client's reader.read() loop waiting indefinitely, causing the UI to appear stuck. Two changes: - chat.ts: call controller.close() after exhausting the upstream body, wrapped in try/catch to silently ignore the case where [DONE] already closed it. Also guard the text encoding so undefined delta content (the final finish_reason chunk) is not encoded. - ConversationView: wrap the stream-reading loop in try/catch so that any stream error marks the message as FAILED instead of leaving it stuck in LOADING state. Co-Authored-By: Octopus <liyuan851277048@icloud.com>

Copilot

Pull request overview

Fixes a streaming hang where the backend ReadableStream could remain open if an upstream provider ends the response without sending the SSE data: [DONE] sentinel, and improves frontend handling when the stream errors so the UI doesn’t remain stuck in LOADING.

Changes:

Backend: skip encoding/enqueueing empty delta chunks; ensure the response stream is closed after the upstream body is fully consumed.
Frontend: wrap the stream-reading loop in try/catch and mark the assistant message as FAILED on read errors.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
`src/pages/api/chat.ts`	Closes the server stream after exhausting upstream chunks; avoids encoding undefined/empty deltas.
`src/components/ConversationView/index.tsx`	Adds error handling around `reader.read()` loop to prevent indefinite `LOADING` state on stream errors.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-26T10:04:12Z

+      // Ensure the stream is closed after all upstream chunks are consumed.
+      // Some providers (e.g. Ollama native API) may not send a [DONE] sentinel,
+      // which would leave the ReadableStream open and the client hanging indefinitely.
+      try {
+        controller.close();


TextDecoder.decode(chunk) is used without { stream: true } while iterating arbitrary Uint8Array chunks. If a multibyte UTF-8 code point is split across chunks, decoding can introduce replacement characters and potentially break SSE parsing/JSON parsing. Consider decoding in streaming mode (decoder.decode(chunk, { stream: true })) and flushing once at the end (before closing) to handle boundary-split characters correctly.

Copilot · 2026-04-26T10:04:12Z

+        const { value, done: readerDone } = await reader.read();
+        if (value) {
+          const char = decoder.decode(value);
+          if (char) {
+            assistantMessage.content = assistantMessage.content + char;


TextDecoder.decode(value) is called without { stream: true } inside a loop reading arbitrary byte chunks. This can corrupt multibyte UTF-8 characters that span chunk boundaries. Use streaming decode (decoder.decode(value, { stream: true })) and flush after the loop to ensure correct character reconstruction.

Copilot · 2026-04-26T10:04:12Z

+    try {
+      while (!done) {
+        const { value, done: readerDone } = await reader.read();
+        if (value) {
+          const char = decoder.decode(value);
+          if (char) {
+            assistantMessage.content = assistantMessage.content + char;
+            messageStore.updateMessage(assistantMessage.id, {
+              content: assistantMessage.content,
+            });
+          }
        }
+        done = readerDone;
      }
-      done = readerDone;
+    } catch (error) {
+      messageStore.updateMessage(assistantMessage.id, {
+        content: assistantMessage.content || "Failed to receive response. Please check your API endpoint configuration.",
+        status: "FAILED",
+      });
+      return;


The reader lock is never released/canceled. In the error path you return inside catch, and in the success path you exit the loop without calling reader.releaseLock() (or reader.cancel() when failing). Wrapping the read loop in try/catch/finally and releasing/canceling the reader in finally will avoid leaving the stream locked and can help ensure the underlying connection is torn down promptly on errors.

octo-patch requested review from boojack and tianzhou as code owners April 26, 2026 02:44

tianzhou requested a review from Copilot April 26, 2026 10:01

Copilot started reviewing on behalf of tianzhou April 26, 2026 10:01 View session

Copilot AI reviewed Apr 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: close stream after upstream ends to prevent client hang#186

fix: close stream after upstream ends to prevent client hang#186
octo-patch wants to merge 1 commit into
sqlchat:mainfrom
octo-patch:fix/issue-176-stream-not-closing

octo-patch commented Apr 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 26, 2026

Uh oh!

Copilot AI Apr 26, 2026

Uh oh!

Copilot AI Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

octo-patch commented Apr 26, 2026

Problem

Solution

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants