Skip to content

[BUG]: Korean UTF-8 characters are corrupted when querying via MCP SQL endpoint. #47

@Taylor0819

Description

@Taylor0819

Is there an existing issue for this?

  • I have searched the existing issues

Current Behavior

Korean UTF-8 characters are corrupted when querying via MCP SQL endpoint.

Same query returns correct data via direct SQL API (/api/2.0/sql/statements),
but returns corrupted characters via MCP API (/api/2.0/mcp/sql).

Direct SQL API result (correct):
["서울지점", "KR", "오프라인"]

MCP SQL API result (broken):
{"string_value": "������������"}

Expected Behavior

Korean characters should be preserved correctly in MCP SQL response,
same as direct SQL Statement API.

Steps To Reproduce:

  1. Have a table with Korean text data in Databricks
  2. Call MCP SQL endpoint with execute_sql_read_only tool
  3. Korean characters appear as "������" in response
  4. Same query via /api/2.0/sql/statements returns correct Korean text

Relevant log output:
// MCP API response (broken)
{"string_value":"������������"},{"string_value":"KR"},{"string_value":"������������"}

// Direct SQL API response (correct)
["서울지점", "KR", "오프라인"]

Steps To Reproduce

No response

Relevant log output or Exception details

Operating System

macOS

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions