FE-611 | Add Vector Index Feature #21793

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

bluepal-nadeem-abdun wants to merge 16 commits into devel from feature/FE-611/add-new-vector-index-ui

bluepal-nadeem-abdun commented Jun 2, 2025 •

edited

Loading

Scope & Purpose

✨ Feature

Adds support for the new experimental vector index to the collection Indexes view in the core DB UI. The index appears as "Vector index (beta)" in the dropdown, and is only shown when the server is started with the --experimental-vector-index flag.

Checklist

Tests
- Manually tested
📖 CHANGELOG entry made

Related Information

Jira ticket: https://arangodb.atlassian.net/browse/FE-611
Related PR (backend fix): Fix supported indexes API #21792
Note: This backend PR addresses a bug where the vector index appears even without the --experimental-vector-index flag. This PR can proceed with a mention of the backend PR as an upstream dependency.


          FE-611 | Add Vector Index Feature

7490ccb

bluepal-nadeem-abdun requested review from palashkaria, Simran-B, cmyk47 and shd8

June 2, 2025 11:56

bluepal-nadeem-abdun requested a review from a team as a code owner

June 2, 2025 11:57

cla-bot bot added the cla-signed label


          FE-611 | Updated Change Log File

b135d2e

cmyk47 previously requested changes

View reviewed changes

...ps/system/_admin/aardvark/APP/react/src/views/collections/indices/useSupportedIndexTypes.tsx Outdated Show resolved Hide resolved

.../system/_admin/aardvark/APP/react/src/views/collections/indices/CollectionIndicesContext.tsx Outdated Show resolved Hide resolved

...pps/system/_admin/aardvark/APP/react/src/views/collections/indices/addIndex/AddIndexForm.tsx Outdated Show resolved Hide resolved

...pps/system/_admin/aardvark/APP/react/src/views/collections/indices/addIndex/AddIndexForm.tsx Outdated Show resolved Hide resolved

...ps/system/_admin/aardvark/APP/react/src/views/collections/indices/useSupportedIndexTypes.tsx Outdated Show resolved Hide resolved

bluepal-nadeem-abdun added 4 commits

June 2, 2025 21:11


          FE-611 | Initial Review Comments Addressed

190b21e


          Updated ChangeLog File

6aa22dd


          Merge branch 'devel' of https://github.com/arangodb/arangodb into fea…

bc51fef

…ture/FE-611/add-new-vector-index-ui


          Updated ChangeLog File

ee93ff5

KVS85 requested changes

View reviewed changes

...ps/system/_admin/aardvark/APP/react/src/views/collections/indices/useSupportedIndexTypes.tsx Show resolved Hide resolved

CHANGELOG Show resolved Hide resolved

bluepal-nadeem-abdun added 3 commits

June 6, 2025 09:52


          Updated ChangeLog File

f13c1b4


          Merge branch 'devel' of https://github.com/arangodb/arangodb into fea…

…ture/FE-611/add-new-vector-index-ui


          Updated ChangeLog File

cb1d9f1

cmyk47 dismissed their stale review

June 6, 2025 07:18

fixed

Simran-B reviewed

View reviewed changes

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated Show resolved Hide resolved

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated Show resolved Hide resolved

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated Show resolved Hide resolved

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Show resolved Hide resolved

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated Show resolved Hide resolved

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated Show resolved Hide resolved

bluepal-nadeem-abdun added 4 commits

June 9, 2025 11:44


          FE-611 | Secondary Review Comments Addressed

3dbabd1


          Updated ChangeLog File

2c950b8


          Merge branch 'devel' of https://github.com/arangodb/arangodb into fea…

b15d4ea

…ture/FE-611/add-new-vector-index-ui


          Updated ChangeLog File

a6a8bcc

bluepal-nadeem-abdun requested a review from KVS85

June 9, 2025 06:30


          FE-611 | Secondary Review Comments Addressed

994493a

Simran-B reviewed

View reviewed changes

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated

+                  label: "Index Factory",
+                  name: "params.factory",
+                  type: "text",
+                  tooltip: `Defines the FAISS index factory. Must start with "IVF". Example: IVF100_HNSW10,Flat. The number in IVF must match nLists (e.g. IVF100 → nLists = 100).`

Contributor

Simran-B Jun 10, 2025

Suggested change

      
                tooltip: `Defines the FAISS index factory. Must start with "IVF". Example: IVF100_HNSW10,Flat. The number in IVF must match nLists (e.g. IVF100 → nLists = 100).`
          
                tooltip: `Defines the FAISS index factory. Must start with "IVF". Example: IVF100_HNSW10,Flat. The number following "IVF" must match nLists (e.g. IVF100 → nLists = 100).`

Author

bluepal-nadeem-abdun Jun 11, 2025

Resolved the suggested change. Please have a look and let me know if further adjustments are needed.

Simran-B reviewed

View reviewed changes

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated

+                  label: "Default Number of Probes",
+                  name: "params.defaultNProbe",
+                  type: "number",
+                  tooltip: "The number of inverted lists (clusters) to search during queries by default. Increasing this value improves recall at the cost of speed. Default is 1."

Contributor

Simran-B Jun 10, 2025

Suggested change

      
                tooltip: "The number of inverted lists (clusters) to search during queries by default. Increasing this value improves recall at the cost of speed. Default is 1."
          
                tooltip: "The number of inverted lists (clusters) to search during queries by default. Increasing this value improves recall at the cost of speed. The default is 1."

Author

bluepal-nadeem-abdun Jun 11, 2025

Resolved the suggested change. Please have a look and let me know if further adjustments are needed.

Simran-B reviewed

View reviewed changes

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated

+                  name: "params.nLists",
+                  type: "number",
+                  isRequired: true,
+                  tooltip: "The number of Voronoi cells (nLists) to partition the vector space into. A higher value improves recall but increases indexing time. The value must not exceed the number of documents. Suggested: sqrt(N) / 15, where N is the number of documents."

Contributor

Simran-B Jun 10, 2025

As per the FAISS paper, it is 15-20x the square root of the number of documents. The resulting number can be higher than allowed (if you have <223 docs) but I think this is already described adequately.

Suggested change

      
                tooltip: "The number of Voronoi cells (nLists) to partition the vector space into. A higher value improves recall but increases indexing time. The value must not exceed the number of documents. Suggested: sqrt(N) / 15, where N is the number of documents."
          
                tooltip: "The number of Voronoi cells (nLists) to partition the vector space into. A higher value improves recall but increases indexing time. The value must not exceed the number of documents. Suggested: 15 * sqrt(N), where N is the number of documents."

Author

bluepal-nadeem-abdun Jun 11, 2025

Resolved the suggested change. Please have a look and let me know if further adjustments are needed.

Simran-B reviewed

View reviewed changes

...ardvark/APP/react/src/views/collections/indices/addIndex/vectorIndex/useCreateVectorIndex.ts Outdated

+                  label: "Training Iterations",
+                  name: "params.trainingIterations",
+                  type: "number",
+                  tooltip: "The number of iterations to use during index training. More iterations improve cluster quality and accuracy, but increase training time. Default is 25."

Contributor

Simran-B Jun 10, 2025

Suggested change

      
                tooltip: "The number of iterations to use during index training. More iterations improve cluster quality and accuracy, but increase training time. Default is 25."
          
                tooltip: "The number of iterations to use during index training. More iterations improve cluster quality and accuracy, but increase training time. The default is 25."

Author

bluepal-nadeem-abdun Jun 11, 2025

Resolved the suggested change. Please have a look and let me know if further adjustments are needed.

Simran-B requested changes

View reviewed changes

Contributor

Simran-B left a comment

The vector index supports the following options (not persisted / returned by the server), similar to other index types:

inBackground (see below)
parallelism: The number of threads to use for indexing. The default is 2.

The only other index with a parallelism option seems to be the inverted index but we don't display it in the UI - I'm not sure if this is a conscience decision or not. I believe we found that 2 is a bit faster than 1 but any larger number were performing worse than 2, so there is little point in changing it for the inverted index. It could be very different for the vector index, though.

I noticed that e.g. for the persistent index, we don't have a particularly useful tooltip for Create in background:

My suggestion is to change it everywhere to something like this:

Enable this option to keep the collection/shards available for write operations by not using an exclusive write lock for the duration of the index creation.

bluepal-nadeem-abdun added 2 commits

June 10, 2025 18:44


          FE-611 | Review Comments Addressed

f94291b


          FE-611 | Review Comments Addressed - Added Parallelism And InBackgrou…

85f1fa3

…nd Feilds

bluepal-nadeem-abdun requested a review from Simran-B

June 12, 2025 10:34

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

cmyk47 cmyk47 left review comments

palashkaria Awaiting requested review from palashkaria

shd8 Awaiting requested review from shd8

KVS85 Awaiting requested review from KVS85

Simran-B Awaiting requested review from Simran-B

Requested changes must be addressed to merge this pull request.

Labels