added random_landmarking support for precomputed distance/affinity #88

MattScicluna · 2025-12-18T18:54:11Z

close #87

coveralls · 2025-12-18T18:57:22Z

Pull Request Test Coverage Report for Build 20349244901

Details

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 0.0%

Totals
Change from base Build 18852513838:	0.0%
Covered Lines:	0
Relevant Lines:	0

💛 - Coveralls

Copilot

Pull request overview

This PR adds support for random landmarking with precomputed distance and affinity matrices, addressing issue #87. Previously, random landmarking only worked with raw data using Euclidean distance computations. Now users can provide precomputed distance or affinity matrices and use random landmarking for landmark-based dimensionality reduction.

Key changes:

Extended build_landmark_op() to detect and handle precomputed matrices by using affinity-based clustering instead of distance-based clustering
Added test coverage for random landmarking with precomputed affinity matrices
Preserved existing functionality for non-precomputed graphs

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
graphtools/graphs.py	Modified `build_landmark_op()` method to check for precomputed matrices and use kernel-based affinity clustering (argmax) instead of distance-based clustering (argmin) when precomputed matrices are detected
test/test_random_landmarking.py	Added new test `test_random_landmarking_with_precomputed_affinity()` to verify random landmarking works correctly with precomputed affinity matrices

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-18T18:58:23Z

test/test_random_landmarking.py

+def test_random_landmarking_with_precomputed_affinity():
+    """Random landmarking should work with precomputed affinity matrices"""
+    affinity = np.array(
+        [
+            [1.0, 0.8, 0.1, 0.0, 0.0, 0.0],
+            [0.8, 1.0, 0.2, 0.0, 0.0, 0.0],
+            [0.1, 0.2, 1.0, 0.9, 0.4, 0.0],
+            [0.0, 0.0, 0.9, 1.0, 0.5, 0.2],
+            [0.0, 0.0, 0.4, 0.5, 1.0, 0.9],
+            [0.0, 0.0, 0.0, 0.2, 0.9, 1.0],
+        ]
+    )
+    affinity = (affinity + affinity.T) / 2  # ensure symmetry
+    n_landmark = 3
+    random_state = 42
+
+    G = graphtools.Graph(
+        affinity,
+        precomputed="affinity",
+        n_landmark=n_landmark,
+        random_landmarking=True,
+        random_state=random_state,
+        knn=3,
+        thresh=0,
+    )
+
+    # Trigger landmark construction
+    _ = G.landmark_op
+
+    rng = np.random.default_rng(random_state)
+    landmark_indices = rng.choice(affinity.shape[0], n_landmark, replace=False)
+    expected_clusters = np.asarray(
+        G.kernel[:, landmark_indices].argmax(axis=1)
+    ).reshape(-1)
+
+    assert np.array_equal(G.clusters, expected_clusters)
+    assert G.transitions.shape == (affinity.shape[0], n_landmark)
+    assert G.landmark_op.shape == (n_landmark, n_landmark)
+


While this test covers precomputed="affinity", there is no test coverage for precomputed="distance". Since the implementation in graphs.py handles precomputed distance matrices by converting them to affinity matrices in build_kernel(), it would be valuable to add a similar test case that verifies random landmarking works correctly with precomputed distance matrices as well.

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-18T19:20:54Z

test/test_random_landmarking.py

+def test_random_landmarking_with_precomputed_affinity():
+    """Random landmarking should work with precomputed affinity matrices"""
+    affinity = np.array(
+        [
+            [1.0, 0.8, 0.1, 0.0, 0.0, 0.0],
+            [0.8, 1.0, 0.2, 0.0, 0.0, 0.0],
+            [0.1, 0.2, 1.0, 0.9, 0.4, 0.0],
+            [0.0, 0.0, 0.9, 1.0, 0.5, 0.2],
+            [0.0, 0.0, 0.4, 0.5, 1.0, 0.9],
+            [0.0, 0.0, 0.0, 0.2, 0.9, 1.0],
+        ]
+    )
+    affinity = (affinity + affinity.T) / 2  # ensure symmetry
+    n_landmark = 3
+    random_state = 42
+
+    G = graphtools.Graph(
+        affinity,
+        precomputed="affinity",
+        n_landmark=n_landmark,
+        random_landmarking=True,
+        random_state=random_state,
+        knn=3,
+        thresh=0,
+    )
+
+    # Trigger landmark construction
+    _ = G.landmark_op
+
+    rng = np.random.default_rng(random_state)
+    landmark_indices = rng.choice(affinity.shape[0], n_landmark, replace=False)
+    expected_clusters = np.asarray(
+        G.kernel[:, landmark_indices].argmax(axis=1)
+    ).reshape(-1)
+
+    assert np.array_equal(G.clusters, expected_clusters)
+    assert G.transitions.shape == (affinity.shape[0], n_landmark)
+    assert G.landmark_op.shape == (n_landmark, n_landmark)
+


The test only covers dense precomputed affinity matrices. Consider adding test coverage for sparse precomputed affinity matrices to ensure the sparse matrix handling code path (lines 1210-1217 in graphs.py) works correctly with random landmarking.

Copilot · 2025-12-18T19:20:55Z

test/test_random_landmarking.py

+def test_random_landmarking_with_precomputed_distance():
+    """Random landmarking should work with precomputed distance matrices"""
+    dist = np.array(
+        [
+            [0, 1, 4, 4, 4, 4],
+            [1, 0, 4, 4, 4, 4],
+            [4, 4, 0, 1, 4, 4],
+            [4, 4, 1, 0, 4, 4],
+            [4, 4, 4, 4, 0, 1],
+            [4, 4, 4, 4, 1, 0],
+        ]
+    )
+
+    n_landmark = 3
+    random_state = 42
+
+    G = graphtools.Graph(
+        dist,
+        precomputed="distance",
+        n_landmark=n_landmark,
+        random_landmarking=True,
+        random_state=random_state,
+        bandwidth=1,  # deterministic affinity: exp(-dist)
+        decay=1,
+        thresh=0,
+        knn=3,
+    )
+
+    # Trigger landmark construction
+    _ = G.landmark_op
+
+    rng = np.random.default_rng(random_state)
+    landmark_indices = rng.choice(dist.shape[0], n_landmark, replace=False)
+    expected_clusters = np.asarray(
+        G.kernel[:, landmark_indices].argmax(axis=1)
+    ).reshape(-1)
+
+    assert np.array_equal(G.clusters, expected_clusters)
+    assert G.transitions.shape == (dist.shape[0], n_landmark)
+    assert G.landmark_op.shape == (n_landmark, n_landmark)
+


The test only covers dense precomputed distance matrices. Consider adding test coverage for sparse precomputed distance matrices to ensure the sparse matrix code path works correctly with random landmarking.

Copilot · 2025-12-18T19:20:55Z

test/test_random_landmarking.py

+def test_random_landmarking_with_precomputed_affinity():
+    """Random landmarking should work with precomputed affinity matrices"""
+    affinity = np.array(
+        [
+            [1.0, 0.8, 0.1, 0.0, 0.0, 0.0],
+            [0.8, 1.0, 0.2, 0.0, 0.0, 0.0],
+            [0.1, 0.2, 1.0, 0.9, 0.4, 0.0],
+            [0.0, 0.0, 0.9, 1.0, 0.5, 0.2],
+            [0.0, 0.0, 0.4, 0.5, 1.0, 0.9],
+            [0.0, 0.0, 0.0, 0.2, 0.9, 1.0],
+        ]
+    )
+    affinity = (affinity + affinity.T) / 2  # ensure symmetry
+    n_landmark = 3
+    random_state = 42
+
+    G = graphtools.Graph(
+        affinity,
+        precomputed="affinity",
+        n_landmark=n_landmark,
+        random_landmarking=True,
+        random_state=random_state,
+        knn=3,
+        thresh=0,
+    )
+
+    # Trigger landmark construction
+    _ = G.landmark_op
+
+    rng = np.random.default_rng(random_state)
+    landmark_indices = rng.choice(affinity.shape[0], n_landmark, replace=False)
+    expected_clusters = np.asarray(
+        G.kernel[:, landmark_indices].argmax(axis=1)
+    ).reshape(-1)
+
+    assert np.array_equal(G.clusters, expected_clusters)
+    assert G.transitions.shape == (affinity.shape[0], n_landmark)
+    assert G.landmark_op.shape == (n_landmark, n_landmark)
+


Consider adding a test case for the edge case where some samples have zero affinity to all randomly selected landmarks. This would test the warning logic implemented in lines 1223-1228 of graphs.py, which is currently not covered by these tests.

Copilot · 2025-12-18T19:20:55Z

graphtools/graphs.py

+                precomputed = getattr(self, "precomputed", None)
+
+                if precomputed is not None:
+                    # Use the precomputed affinities/distances directly to avoid Euclidean fallback


The comment could be clearer. It says "Use the precomputed affinities/distances directly" but the code actually uses self.kernel, which is always an affinity matrix (distances are converted to affinities in build_kernel). Consider updating to "Use affinities from the kernel computed from the precomputed matrix" for clarity.

Suggested change

# Use the precomputed affinities/distances directly to avoid Euclidean fallback

# Use affinities from the kernel computed from the precomputed matrix to avoid Euclidean fallback

MattScicluna · 2025-12-18T19:30:31Z

@copilot open a new pull request to apply changes based on the comments in this thread

Copilot · 2025-12-18T19:30:40Z

@MattScicluna I've opened a new pull request, #89, to work on those changes. Once the pull request is ready, I'll request review from you.

…prove comment clarity Co-authored-by: MattScicluna <19255250+MattScicluna@users.noreply.github.com>

Add sparse matrix and edge case test coverage for random landmarking with precomputed matrices

added random_landmarking support for precomputed distance/affinity

2d65034

MattScicluna requested review from bjoaofelipe and Copilot December 18, 2025 18:54

Copilot started reviewing on behalf of MattScicluna December 18, 2025 18:54 View session

MattScicluna mentioned this pull request Dec 18, 2025

Random landmarking incompatible with precomputed affinities #87

Open

Copilot AI reviewed Dec 18, 2025

View reviewed changes

added random landmarking precomputed distance test

45c96ca

MattScicluna requested a review from Copilot December 18, 2025 19:16

Copilot started reviewing on behalf of MattScicluna December 18, 2025 19:17 View session

Copilot AI reviewed Dec 18, 2025

View reviewed changes

Initial plan

4be342a

Copilot AI mentioned this pull request Dec 18, 2025

Add sparse matrix and edge case test coverage for random landmarking with precomputed matrices #89

Merged

Copilot AI and others added 2 commits December 18, 2025 19:35

Add test coverage for sparse matrices and zero affinity edge case, im…

78f0c89

…prove comment clarity Co-authored-by: MattScicluna <19255250+MattScicluna@users.noreply.github.com>

Merge pull request #89 from KrishnaswamyLab/copilot/sub-pr-88

eb30e4a

Add sparse matrix and edge case test coverage for random landmarking with precomputed matrices

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

added random_landmarking support for precomputed distance/affinity #88

added random_landmarking support for precomputed distance/affinity #88

Uh oh!

MattScicluna commented Dec 18, 2025

Uh oh!

coveralls commented Dec 18, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

Copilot AI Dec 18, 2025

Uh oh!

MattScicluna commented Dec 18, 2025

Uh oh!

Copilot AI commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	# Use the precomputed affinities/distances directly to avoid Euclidean fallback
	# Use affinities from the kernel computed from the precomputed matrix to avoid Euclidean fallback

added random_landmarking support for precomputed distance/affinity #88

Are you sure you want to change the base?

added random_landmarking support for precomputed distance/affinity #88

Uh oh!

Conversation

MattScicluna commented Dec 18, 2025

Uh oh!

coveralls commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 20349244901

Details

💛 - Coveralls

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

MattScicluna commented Dec 18, 2025

Uh oh!

Copilot AI commented Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

coveralls commented Dec 18, 2025 •

edited

Loading