feat(api)!: support extra_body to embeddings and vector_stores APIs #3794

ashwinb · 2025-10-11T23:59:32Z

Applies the same pattern from #3777 to embeddings and vector_stores.create() endpoints.

This should not be a breaking change since (a) our tests were already using the extra_body parameter when passing in to the backend (b) but the backend probably wasn't extracting the parameters correctly. This PR will fix that.

Updated APIs: openai_embeddings(), openai_create_vector_store(), openai_create_vector_store_file_batch()

…s APIs Applies the same pattern from #3777 to embeddings and vector_stores.create() endpoints. Breaking change: Method signatures now accept a single params object with Pydantic extra="allow" instead of individual parameters. Provider-specific params can be passed via extra_body and accessed through params.model_extra. Updated APIs: openai_embeddings(), openai_create_vector_store(), openai_create_vector_store_file_batch()

VectorIORouter was still using old individual parameter signature instead of the new params object. Updated both openai_create_vector_store and openai_create_vector_store_file_batch methods to match the API protocol.

…thods VectorDBsRoutingTable was removed in a165b8b, so VectorIORouter needs to get the provider directly using routing_table.get_provider_impl() before calling provider methods, consistent with how insert_chunks() already works.

franciscojavierarceo · 2025-10-12T01:30:54Z

llama_stack/core/routers/vector_io.py

+        # Extract llama-stack-specific parameters from extra_body
+        extra = params.model_extra or {}
+        embedding_model = extra.get("embedding_model")
+        embedding_dimension = extra.get("embedding_dimension", 384)


do we still want this default?

i guess so for tests

@franciscojavierarceo yeah I should kill that and see what breaks

franciscojavierarceo

lgtm

ashwinb · 2025-10-13T02:01:47Z

Green finally. Corresponding llama-stack-client changes: llamastack/llama-stack-client-python#280

…lamastack#3794) Applies the same pattern from llamastack#3777 to embeddings and vector_stores.create() endpoints. This should _not_ be a breaking change since (a) our tests were already using the `extra_body` parameter when passing in to the backend (b) but the backend probably wasn't extracting the parameters correctly. This PR will fix that. Updated APIs: `openai_embeddings(), openai_create_vector_store(), openai_create_vector_store_file_batch()`

ashwinb requested review from bbrowning, ehhuang, franciscojavierarceo, hardikjshah, leseb, mattf, raghotham, reluctantfuturist, slekkala1, terrytangyuan and yanxi0830 as code owners October 11, 2025 23:59

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 11, 2025

ashwinb changed the title ~~feat(api)!: support passing extra_body to embeddings and vector_stores APIs~~ feat(api): support passing extra_body to embeddings and vector_stores APIs Oct 12, 2025

ashwinb added 2 commits October 11, 2025 17:02

fix(router): update VectorIORouter to use new params signature

8fa91f9

VectorIORouter was still using old individual parameter signature instead of the new params object. Updated both openai_create_vector_store and openai_create_vector_store_file_batch methods to match the API protocol.

fix: extract llama-stack params from model_extra, not as explicit fields

3568ccd

ashwinb changed the title ~~feat(api): support passing extra_body to embeddings and vector_stores APIs~~ feat(api): support extra_body to embeddings and vector_stores APIs Oct 12, 2025

franciscojavierarceo reviewed Oct 12, 2025

View reviewed changes

ashwinb added 2 commits October 11, 2025 21:52

fixes

bf59d26

more fixes

e5a1cdf

ashwinb changed the title ~~feat(api): support extra_body to embeddings and vector_stores APIs~~ feat(api)!: support extra_body to embeddings and vector_stores APIs Oct 13, 2025

pre-commit and unit test fixes

eaa91aa

franciscojavierarceo approved these changes Oct 13, 2025

View reviewed changes

ashwinb added 3 commits October 12, 2025 18:34

Merge remote-tracking branch 'origin/main' into embeddings_extra_body

33d7119

undo vector_store_id in params

e642849

fix batch embeddings

bbdde4e

ashwinb merged commit ecc8a55 into main Oct 13, 2025
22 checks passed

ashwinb deleted the embeddings_extra_body branch October 13, 2025 02:01

jiayin-nvidia mentioned this pull request Oct 14, 2025

refactor: use extra_body to pass in input_type params for asymmetric embedding models for NVIDIA Inference Provider #3804

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(api)!: support extra_body to embeddings and vector_stores APIs #3794

feat(api)!: support extra_body to embeddings and vector_stores APIs #3794

Uh oh!

ashwinb commented Oct 11, 2025 •

edited

Loading

Uh oh!

franciscojavierarceo Oct 12, 2025

Uh oh!

franciscojavierarceo Oct 12, 2025

Uh oh!

ashwinb Oct 12, 2025

Uh oh!

franciscojavierarceo left a comment

Uh oh!

ashwinb commented Oct 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(api)!: support extra_body to embeddings and vector_stores APIs #3794

feat(api)!: support extra_body to embeddings and vector_stores APIs #3794

Uh oh!

Conversation

ashwinb commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

franciscojavierarceo Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

franciscojavierarceo Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

ashwinb Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

franciscojavierarceo left a comment

Choose a reason for hiding this comment

Uh oh!

ashwinb commented Oct 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ashwinb commented Oct 11, 2025 •

edited

Loading