SNOW-2084165 Add dataframe operation lineage on SnowparkSQLException by sfc-gh-aalam · Pull Request #3339 · snowflakedb/snowpark-python

sfc-gh-aalam · 2025-05-07T19:02:55Z

Which Jira issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes SNOW-2084165
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
  - If this test skips Local Testing mode, I'm requesting review from @snowflakedb/local-testing
- I am adding new logging messages
- I am adding a new telemetry message
- I am adding new credentials
- I am adding a new dependency
- If this is a new feature/behavior, I'm adding the Local Testing parity changes.
- I acknowledge that I have ensured my changes to be thread-safe. Follow the link for more information: Thread-safe Developer Guidelines
- If adding any arguments to public Snowpark APIs or creating new public Snowpark APIs, I acknowledge that I have ensured my changes include AST support. Follow the link for more information: AST Support Guidelines
Please describe how your code solves the related issue.

see doc.

…lakedb/snowpark-python into aalam-SNOW-2084165-add-error-trace

sfc-gh-snowflakedb-snyk-sa · 2025-05-07T20:28:51Z

🎉 Snyk checks have passed. No issues have been found so far.

✅ security/snyk check is complete. No issues have been found. (View Details)

✅ license/snyk check is complete. No issues have been found. (View Details)

…lakedb/snowpark-python into aalam-SNOW-2084165-add-error-trace

sfc-gh-jkew · 2025-05-27T18:11:59Z

src/snowflake/snowpark/_internal/debug_utils.py

+        """Returns the batch_ids of the children of this node."""
+        return get_dependent_bind_ids(self.stmt_cache[self.batch_id])
+
+    def get_src(self) -> Optional[proto.SrcPosition]:


In the hybrid client prototype we are using a slightly different method to get the source location; by just using inspect to walk the stack to the appropriate source location. We have to do this because modin is not using any of the AST stuff, but it's also relatively straight forward.

I sort of want to use your debugging tool for snowpandas as well; but we may want to refactor this so we don't require any of the protobuf work.

snowpark-python/src/snowflake/snowpark/modin/plugin/_internal/telemetry.py

Line 625 in 685fc31

def get_user_source_location(group: str) -> dict[str, str]:

What about using this function?

snowpark-python/src/snowflake/snowpark/_internal/open_telemetry.py

Line 116 in f415c73

def context_manager_code_location(frame_info, func) -> Tuple[str, int]:

Essentially we seem to have three approaches to this problem. I'm /less/ of a fan of the AST because it doesn't help pandas for this type of debugging. but it seems like we might be able to consolidate w/ the open telemetry approach.

sfc-gh-aling · 2025-05-29T19:11:33Z

src/snowflake/snowpark/context.py

+_enable_dataframe_trace_on_error = False
+
+
+def configure_development_features(


this is similar to what I would expect, see my comment on: #3380 (comment)

I'm thinking of the following to provide unified debug config experience.

@experimental(version="1.33.0") def debug_config( * enable_eager_schema_validation=False, enable_dataframe_trace_on_error=False, )

when users want to enable, they do

import snowflake.snowpark.context snowflake.snowpark.context.debug_config(enable_eager_schema_validation=True) # or snowflake.snowpark.context.debug_config( enable_eager_schema_validation=True, enable_dataframe_trace_on_error=True )

good idea. @sfc-gh-jrose and I are aligned on the name. Let me add the @experimental decorator as well.

could not import decorator due to circular import issues but added a warning there

sfc-gh-jrose · 2025-05-29T19:59:02Z

src/snowflake/snowpark/context.py

+
+def configure_development_features(
+    *,
+    enable_dataframe_trace_on_error: bool = False,


I think we should default to True. That way users that want a basic development mode can call this function without any parameters.

…ion' into aalam-SNOW-2084165-add-error-trace

src/snowflake/snowpark/context.py

…lakedb/snowpark-python into aalam-SNOW-2084165-add-error-trace

sfc-gh-heshah · 2025-06-10T18:18:33Z

src/snowflake/snowpark/_internal/debug_utils.py

+    """A node representing a dataframe operation in the DAG that represents the lineage of a DataFrame."""
+
+    def __init__(self, batch_id: int, stmt_cache: Dict[int, proto.Stmt]) -> None:
+        self.batch_id = batch_id


Nit: I would argue that this isn't meant to be batch ID anymore. Within each Python session that imports the Snowpark module, each AST ID for a Table or Dataframe will be a UID.

sfc-gh-aalam added 7 commits May 6, 2025 10:22

init

5d7117a

minor improvement in display

4ddd5d8

improvement

6b3e259

handle nested calls

c62053a

Merge branch 'main' into aalam-SNOW-2084165-add-error-trace

44f93a2

improve type hints

7c58414

Merge branch 'aalam-SNOW-2084165-add-error-trace' of github.com:snowf…

ee12e72

…lakedb/snowpark-python into aalam-SNOW-2084165-add-error-trace

sfc-gh-aalam added 17 commits May 7, 2025 15:55

refactor + add more comments

c3f0374

+comments

f5b4946

more comments

108dcdc

Merge branch 'main' into aalam-SNOW-2084165-add-error-trace

432ec6b

more protection; more comments

f6c72a3

Merge branch 'aalam-SNOW-2084165-add-error-trace' of github.com:snowf…

b410188

…lakedb/snowpark-python into aalam-SNOW-2084165-add-error-trace

don't read extra

e1d15b4

efficient file reads

34bd358

minor improvement

3c9f5f8

repl and snippet collection improvements

b239195

add integ tests

6dfd0b4

add unit test

db6a354

fix test

b5bd7e5

refactor for better error handling

24a4e77

fix ast tests

eaa59b3

minor clean-up

c197a2e

fix tests

8bc7243

sfc-gh-aalam marked this pull request as ready for review May 13, 2025 18:02

sfc-gh-aalam requested review from a team as code owners May 13, 2025 18:03

sfc-gh-aalam requested a review from sfc-gh-jkew May 13, 2025 18:03

Merge branch 'main' into aalam-SNOW-2084165-add-error-trace

2ffada8

sfc-gh-aalam requested a review from a team May 23, 2025 18:21

sfc-gh-jrose approved these changes May 23, 2025

View reviewed changes

sfc-gh-jdu approved these changes May 27, 2025

View reviewed changes

sfc-gh-jkew reviewed May 27, 2025

View reviewed changes

sfc-gh-aalam added 2 commits May 28, 2025 17:19

update how we enable/disable

73e86b6

merge with main

32560cb

sfc-gh-aalam requested a review from a team May 29, 2025 00:22

sfc-gh-aling reviewed May 29, 2025

View reviewed changes

sfc-gh-jrose reviewed May 29, 2025

View reviewed changes

sfc-gh-aalam added 8 commits May 29, 2025 13:39

address comments

5a38b4a

address comments

394b10b

Allow user to enable/disable ast collection

cc8eb7c

Merge branch 'aalam-SNOW-2110972-allow-local-override-for-ast-collect…

697e969

…ion' into aalam-SNOW-2084165-add-error-trace

minor fix

ff95ad8

Merge branch 'aalam-SNOW-2110972-allow-local-override-for-ast-collect…

452f9ec

…ion' into aalam-SNOW-2084165-add-error-trace

update changelog

670295d

fix unit tests

8b12c3e

sfc-gh-aalam changed the base branch from main to aalam-SNOW-2110972-allow-local-override-for-ast-collection June 4, 2025 23:21

Merge branch 'aalam-SNOW-2110972-allow-local-override-for-ast-collect…

19b20ff

…ion' into aalam-SNOW-2084165-add-error-trace

graphite-app bot reviewed Jun 4, 2025

View reviewed changes

src/snowflake/snowpark/context.py Outdated Show resolved Hide resolved

Base automatically changed from aalam-SNOW-2110972-allow-local-override-for-ast-collection to main June 6, 2025 22:53

sfc-gh-aalam added 3 commits June 6, 2025 16:38

Merge branch 'main' into aalam-SNOW-2084165-add-error-trace

24ff32c

Merge branch 'aalam-SNOW-2084165-add-error-trace' of github.com:snowf…

acb68c9

…lakedb/snowpark-python into aalam-SNOW-2084165-add-error-trace

undo minor err

a53c60c

sfc-gh-heshah reviewed Jun 10, 2025

View reviewed changes

sfc-gh-heshah approved these changes Jun 10, 2025

View reviewed changes

sfc-gh-aalam merged commit b0ebd5e into main Jun 10, 2025
36 of 39 checks passed

sfc-gh-aalam deleted the aalam-SNOW-2084165-add-error-trace branch June 10, 2025 20:43

github-actions bot locked and limited conversation to collaborators Jun 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SNOW-2084165 Add dataframe operation lineage on SnowparkSQLException#3339

SNOW-2084165 Add dataframe operation lineage on SnowparkSQLException#3339
sfc-gh-aalam merged 95 commits intomainfrom
aalam-SNOW-2084165-add-error-trace

sfc-gh-aalam commented May 7, 2025 •

edited

Loading

Uh oh!

sfc-gh-snowflakedb-snyk-sa commented May 7, 2025 •

edited

Loading

Uh oh!

sfc-gh-jkew May 27, 2025

Uh oh!

sfc-gh-jkew Jun 6, 2025

Uh oh!

sfc-gh-aling May 29, 2025 •

edited

Loading

Uh oh!

sfc-gh-aalam May 29, 2025

Uh oh!

sfc-gh-aalam May 29, 2025

Uh oh!

sfc-gh-jrose May 29, 2025

Uh oh!

Uh oh!

sfc-gh-heshah Jun 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

		_enable_dataframe_trace_on_error = False


		def configure_development_features(

Conversation

sfc-gh-aalam commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sfc-gh-snowflakedb-snyk-sa commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🎉 Snyk checks have passed. No issues have been found so far.

Uh oh!

sfc-gh-jkew May 27, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-jkew Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-aling May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sfc-gh-aalam May 29, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-aalam May 29, 2025

Choose a reason for hiding this comment

Uh oh!

sfc-gh-jrose May 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sfc-gh-heshah Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

sfc-gh-aalam commented May 7, 2025 •

edited

Loading

sfc-gh-snowflakedb-snyk-sa commented May 7, 2025 •

edited

Loading

sfc-gh-aling May 29, 2025 •

edited

Loading