-
Notifications
You must be signed in to change notification settings - Fork 2
Bulk embed and pools #11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Co-authored-by: jdavid10001 <jdavid10001@gmail.com>
|
Cursor Agent can help with this pull request. Just |
|
@cursoragent please start with changes. Looking forward to seeing your work. |
|
Thanks for the go-ahead! I’ve implemented the bulk-embed feature set:
Tests run: If you’d like, I can run the full DB-backed bulk spec suite or prepare a commit/PR summary. Learn more about Cursor Agents |
Add bulk embedding functionality with a new job, run-tracking collection, endpoint, and admin "Embed all" button to enable efficient backfilling and flexible ingest modes.
payloadcms-vectorize:bulk-embed-alltask handles selecting missing-version docs, delegating to provider callbacks (or the in-process fallback), polling, and reconciling embeddings.bulkto skip realtime vectorization, clear stale embeddings on updates, and rely on the bulk pipeline./api/vector-bulk-embedto queue a run.vector-bulk-embeddings-runs, added the bulk endpoint, updated dev config/helpers for bulk callbacks, and documentedingestMode+ bulk callbacks.