Feat/bigquery datasource [ part - 1 ] by anoop-narang · Pull Request #115 · hotdata-dev/runtimedb

anoop-narang · 2026-02-10T14:24:36Z

Add BigQuery as a native datasource

Adds BigQuery support as a native datasource, allowing users to connect to GCP BigQuery
projects, discover tables, and query them through DataFusion.

What's included

BigQuery datasource implementation — connects via service account credentials, discovers
tables from INFORMATION_SCHEMA, and fetches data using BigQuery Jobs API with pagination and
batched Arrow writes
Configurable region — supports cross-dataset discovery with a region parameter (defaults to
"us"), or scoped discovery when a specific dataset is provided
Inline credential support — credentials_json in the connection config is automatically
stored as a secret and linked to the connection, matching the existing flow for Postgres
passwords

Connection API

  {
      "name": "my_bq",
      "source_type": "bigquery",
      "config": {
          "project_id": "my-gcp-project",
          "credentials_json": "{...service account JSON...}",
          "region": "US",
          "dataset": "my_dataset"
      }
  }

Alternatively, credentials can be pre-created as a secret and referenced via secret_name or
secret_id.

Add BigQuery as a native data source with table discovery and data fetching via the gcp-bigquery-client crate. Includes Source enum variant, credential support, and integration into the NativeFetcher.

Replace hardcoded "region-us" with a configurable region field on the BigQuery source config, defaulting to "us" when omitted.

Paginate through all result pages via get_query_results instead of only reading the first page. Buffer rows and flush in 10k-row batches to bound memory usage, matching the postgres/mysql fetcher pattern.

…s null BigQuery returns ARRAY, STRUCT, and JSON columns as non-string JSON values (arrays, objects). Previously these were silently dropped as nulls, causing Arrow validation errors on non-nullable columns. Now they are serialized to their JSON string representation.

anoop-narang · 2026-02-10T14:47:22Z

thread 'test_update_table_sync_cache_invalidation' (3845) panicked at tests/caching_catalog_tests.rs:22:10:
Failed to start Redis container: Client(PullImage { descriptor: "redis:7-alpine", err: DockerResponseServerError { status_code: 500, message: "toomanyrequests: You have reached your unauthenticated pull rate limit. https://www.docker.com/increase-rate-limit" } })

The connection handler only recognized password, token, and bearer_token as inline credential fields. BigQuery's credentials_json was not being extracted, stored as a secret, or linked to the connection.

anoop-narang added 7 commits February 10, 2026 15:41

feat(bigquery): add BigQuery native datasource

544a107

Add BigQuery as a native data source with table discovery and data fetching via the gcp-bigquery-client crate. Includes Source enum variant, credential support, and integration into the NativeFetcher.

feat(bigquery): add configurable region for cross-dataset discovery

e582987

Replace hardcoded "region-us" with a configurable region field on the BigQuery source config, defaulting to "us" when omitted.

fix(bigquery): add pagination and batched writes for data fetching

0f3be03

Paginate through all result pages via get_query_results instead of only reading the first page. Buffer rows and flush in 10k-row batches to bound memory usage, matching the postgres/mysql fetcher pattern.

refactor(bigquery): move job_id instead of cloning

fe4a22f

refactor(bigquery): move page_token instead of cloning

642e3c9

style(bigquery): fix clippy and fmt warnings

cc6987a

anoop-narang marked this pull request as ready for review February 10, 2026 14:30

style(bigquery): fix approx_constant clippy lint in float test

a3f4138

anoop-narang changed the title ~~Feat/bigquery datasource~~ Feat/bigquery datasource part - 1 Feb 10, 2026

anoop-narang changed the title ~~Feat/bigquery datasource part - 1~~ Feat/bigquery datasource [ part - 1 ] Feb 10, 2026

fix(bigquery): auto-create secret for inline credentials_json

54428d1

The connection handler only recognized password, token, and bearer_token as inline credential fields. BigQuery's credentials_json was not being extracted, stored as a secret, or linked to the connection.

anoop-narang merged commit ab01651 into main Feb 10, 2026
8 checks passed

anoop-narang deleted the feat/bigquery-datasource branch February 10, 2026 17:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Feat/bigquery datasource [ part - 1 ]#115

Feat/bigquery datasource [ part - 1 ]#115
anoop-narang merged 9 commits intomainfrom
feat/bigquery-datasource

anoop-narang commented Feb 10, 2026 •

edited

Loading

Uh oh!

anoop-narang commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

anoop-narang commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add BigQuery as a native datasource

What's included

Connection API

Uh oh!

anoop-narang commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anoop-narang commented Feb 10, 2026 •

edited

Loading