Releases: chroma-core/chroma
Latest
Version: 1.0.13.dev5
Git ref: refs/heads/main
Build Date: 2025-05-31T12:33
PIP Package: chroma-1.0.13.dev5.tar.gz
Github Container Registry Image: :1.0.13.dev5
DockerHub Image: :1.0.13.dev5
1.0.12
Version: 1.0.12
Git ref: refs/tags/1.0.12
Build Date: 2025-05-31T02:41
PIP Package: chroma-1.0.12.tar.gz
Github Container Registry Image: :1.0.12
DockerHub Image: :1.0.12
What's Changed
- [ENH] Add rust log service memberlist by @HammadB in #4415
- [BUG] Wire up the rust-log-service member ID to the dirty log. by @rescrv in #4645
- [CLN] Failed precondition -> Internal by @HammadB in #4646
- [BUG] Bump testing back to one rust log service replica. by @rescrv in #4648
- [ENH][mdac] Demonstrate and test how to give per-tenant exclusions. by @rescrv in #4639
- [BUG]: don't create duplicate segments for collection under concurrent get_or_create requests by @codetheweb in #4647
- [BUG] Use the right path for my_member_id in rust-log-service. by @rescrv in #4652
- [DOC] fix docs for list_collections by @jairad26 in #4649
- [ENH] Cleanup the single most spammy log line in rls. by @rescrv in #4654
- [ENH]: sysdb changes to support moving collection hard deletes to garbage collector by @codetheweb in #4607
- [ENH]: serde util for optional u128 by @c-gamble in #4659
- [BUG]: root collection ID should be null if empty string by @codetheweb in #4663
- [ENH] Clean up client manager into manager/assigner - make log client use it by @HammadB in #4640
- [ENH] Make explicit seal/migrate calls for the log service. by @rescrv in #4669
- [ENH]: add readiness probe for garbage collector by @codetheweb in #4662
- [ENH]: perform collection hard deletes from garbage collector by @codetheweb in #4605
- [TST] Regex testing by @Sicheng-Pan in #4570
- [ENH]: soft delete databases, add
FinishDatabaseDeletion
gRPC method to hard delete databases by @codetheweb in #4627 - [BLD]: fix typo so that test durations are reported properly by @codetheweb in #4674
- [ENH]: wire GC v2 to new cleanup modes & call FinishDatabaseDeletion from garbage collector by @codetheweb in #4671
- [ENH] Robustly handle errors on the compaction path of the log. by @rescrv in #4677
- [BUG] Raise Error when can't deserialize configuration json from server, lazily load ef on CollectionModel, warn on api_key by @jairad26 in #4471
- [ENH] Add logging statements to figure out why forking is failing in staging. by @rescrv in #4679
- [ENH] Prefetch block by prefixes by @Sicheng-Pan in #4623
- [ENH] Add validation when multiple embedding functions set on client by @jairad26 in #4507
- [BLD]: parallelize cross version test by @codetheweb in #4675
- [ENH]: don't skip GC for collection in fork tree if using GC v2 by @codetheweb in #4681
- [ENH] add Mistral embedding function, fix some naming mismatches across clients by @jairad26 in #4653
- [ENH][chroma-load] Allow reference data sets to reference minilm6v2. by @rescrv in #4687
- [RELEASE] 1.0.12 by @jairad26 in #4685
New Contributors
Full Changelog: 1.0.11...1.0.12
1.0.10
Version: 1.0.10
Git ref: refs/tags/1.0.10
Build Date: 2025-05-22T03:00
PIP Package: chroma-1.0.10.tar.gz
Github Container Registry Image: :1.0.10
DockerHub Image: :1.0.10
What's Changed
- [CHORE] Use bigger machine for build and publish container image by @HammadB in #4533
- [ENH] Expose may_contain for disk cache, use in prefetch by @HammadB in #4530
- [BUG]: use byte offset in full-text reader rather than token position by @codetheweb in #4531
- [ENH]: Add spann metrics by @sanketkedia in #4492
- [ENH]: Add name to various foyer caches by @sanketkedia in #4532
- [ENH]:
ListCollectionsToGc
returns lineage file path, group by fork tree, accept optional tenant for filtering by @codetheweb in #4529 - [BUG]: increase max payload size of log service (Go) by @codetheweb in #4534
- [ENH] Fix 422 Unprocessable Entity error in Chroma Load by @rescrv in #4537
- [CHORE] Reduce default NUM_FORKS quota rule value by @drewkim in #4525
- Revert "[BUG]: increase max payload size of log service (Go) (#4534)" by @eculver in #4540
- [CHORE] Fix test-embeddings.py and test_sanity by @rescrv in #4526
- [BUG] Default embedding function in JS did not create the correct config dict by @jairad26 in #4539
- [ENH] Emit log_uncompacted_record_count from rust log service. by @rescrv in #4541
- [DBG] Add temporary trace for filter argument by @Sicheng-Pan in #4548
- [ENH] Support custom data sets for chroma load. by @rescrv in #4547
- [BUG] Restore increase max payload size of log service (Go) by @eculver in #4546
- [BUG]: v0 in version file for forked collection missing segment file paths from source collection by @codetheweb in #4550
- [ENH] Add log sealing to the go service. by @rescrv in #4554
- [BUG]: Regex search returning documents that are deleted by @sanketkedia in #4562
- [BUG] Add hosted-frontend to OTel stdout layer by @drewkim in #4564
- [BUG]: Handle version file creation with empty file paths by @sanketkedia in #4552
- [BLD] Do not trigger deploy on merge to release branches by @eculver in #4561
- [ENH] For local: use subquery for FTS, unions for int & float metadata expr, is true -> is not null by @jairad26 in #4556
- [ENH] Add a safety cutoff to the rust log service. by @rescrv in #4535
- [ENH] Add handling for ChromaQuotaExceededError by @gshahbazian in #4569
- [ENH] bump JS client to v2.4.5 by @philipithomas in #4574
- [CHORE] Remove validation on ef config update for model path by @jairad26 in #4571
- [ENH] Add a tool to reason through the state space of bootstrap. by @rescrv in #4558
- [RELEASE] CLI 1.1.2 by @itaismith in #4572
- [ENH] Bootstrap a wal3 log from existing content. by @rescrv in #4560
- [BUG] Fix race in get_or_create by @HammadB in #4568
- [BLD]: use blacksmith pull-through container image cache for Tilt cluster by @codetheweb in #4542
- [ENH]: SysDb should return lineage, version file paths and root collection ID on collections by @codetheweb in #4557
- [BLD]: use uv in CI by @codetheweb in #4581
- [BLD] retry kind setup by @codetheweb in #4584
- [RELEASE] 1.0.10 by @itaismith in #4587
Full Changelog: 1.0.9...1.0.10
cli-1.1.2
CLI release.
1.0.9
Version: 1.0.9
Git ref: refs/tags/1.0.9
Build Date: 2025-05-13T03:39
PIP Package: chroma-1.0.9.tar.gz
Github Container Registry Image: :1.0.9
DockerHub Image: :1.0.9
What's Changed
- [CHORE] Fix JS release to support platform specific bindings by @itaismith in #4442
- [RELEASE] Release JS client 2.4.1 by @itaismith in #4443
- [RELEASE] JS Client 2.4.2 by @itaismith in #4444
- [CLN] Warn on log pull failure instead of error by @HammadB in #4435
- [RELEASE] JS Client 2.4.3 by @itaismith in #4448
- [TST] Debug cli tests by @itaismith in #4453
- [ENH] replace group_by and having with distinct and where for local chroma by @jairad26 in #4428
- [CHORE] Update JS bindings CI by @itaismith in #4445
- [ENH] add example of forking using github repos by @jairad26 in #4413
- [DOC] Add docs for CLI-1.1.0 by @itaismith in #4440
- [DOC] Add JS install for the CLI by @itaismith in #4459
- [ENH]: Add retry for add, update and upsert by @sanketkedia in #4457
- [ENH] Add load service to local dev tiltfile by @jasonvigil in #4397
- [CLN] Pass Chroma API key via env var for chroma-load-start by @jasonvigil in #4469
- [ENH] A route and tool to inspect the dirty log. by @rescrv in #4461
- [ENH] Change rust-log-service to a stateful set. by @rescrv in #4477
- [ENH]: Introduce NUM_REGEX_PREDICATES by @sanketkedia in #4429
- [CLN] Cleanup a warning in js_bindings/Cargo.toml. by @rescrv in #4481
- [BUG] Remove misleading log message by @jasonvigil in #4480
- [ENH] Add caching to rust log service. by @rescrv in #4465
- [BUG] Fix CLI update command by @itaismith in #4482
- [ENH]: When SPANN is enabled, route HNSW configuration to SPANN, remove enable_set_index_params by @sanketkedia in #4470
- [ENH] Add RPC on SysDB to get fork count for a collection by @drewkim in #4484
- [BUG] Retry chroma-load upserts when rate limited by @jasonvigil in #4485
- [CLN] CLI - Refactor webbrowser commands by @itaismith in #4455
- [BUG] Restore commented-out test. by @rescrv in #4478
- [ENH][chroma-load] If no api key is present in env, do not auth. by @rescrv in #4486
- [ENH] Configure the cache to have a hostPath and mountPath for log service. by @rescrv in #4483
- [ENH]: Unique constraint violation when two writes contend for collection table on log by @sanketkedia in #4494
- [ENH] write embedding function to config if provided by @jairad26 in #4496
- [ENH] Disallow empty string ids during add by @jairad26 in #4488
- [BUG] Adjust the ping-pong test for wal3 to timeout longer and run less. by @rescrv in #4495
- [ENH] Allow collections to shunt to the alt log by tenant. by @rescrv in #4498
- [ENH][rust-log-service] Use the persistent cache. by @rescrv in #4499
- [CHORE] Support count_forks/num_forks in TestSysDB/Quotas by @drewkim in #4497
- [ENH] Display errors for all log calls. by @rescrv in #4504
- [ENH]: Add quota on regex pattern length by @sanketkedia in #4503
- [BUG] Bump buggy deps to satisfy dependabot by @rescrv in #4505
- [ENH] Make metadata optional to allow deleting metadata in python client by @jairad26 in #4490
- [BLD]: unify Rust Dockerfiles for faster builds by @codetheweb in #4479
- [ENH] QuotaExceededError can include an optional message by @philipithomas in #4489
- [RELEASE] CLI 1.1.1 by @itaismith in #4509
- [CLN] fork_collection quota cleanup by @drewkim in #4508
- [BLD]: fix secret usage for Go test reusable workflow by @codetheweb in #4510
- [ENH] Implement literal expression for regex by @Sicheng-Pan in #4421
- [CHORE] Reimplement fork backstop by @drewkim in #4511
- [ENH] Implement literal provider by @Sicheng-Pan in #4430
- [ENH] Wireup regex in filter operator by @Sicheng-Pan in #4452
- [BUG] Fix blockfile range scan by @Sicheng-Pan in #4467
- [CHORE] Add a log line to go server to see if its scout logs is called by @rescrv in #4514
- [CLN] Rename $matches to $regex by @Sicheng-Pan in #4506
- [ENH] Reorganize load-service traces by @jasonvigil in #4518
- [RELEASE] 1.0.9 by @HammadB in #4523
Full Changelog: 1.0.8...1.0.9
1.0.8
Version: 1.0.8
Git ref: refs/tags/1.0.8
Build Date: 2025-05-05T09:26
PIP Package: chroma-1.0.8.tar.gz
Github Container Registry Image: :1.0.8
DockerHub Image: :1.0.8
What's Changed
- [ENH] add collection forking to js client by @philipithomas in #4372
- [BUG] Fix race between hnsw load and hnsw purge by @sanketkedia in #4345
- [RELEASE] js client v2.3.0 with collection config and forking by @philipithomas in #4373
- [ENH] Make connection configurable per running load-service workload by @jasonvigil in #4374
- [BUG] Handle uninitialized spann segment reader by @sanketkedia in #4348
- [TST] Add property test for collection forking by @Sicheng-Pan in #4375
- [ENH] Fixed Dockerfile for the current genertion of chroma project by @j0hnthebuilder in #4270
- [BUG] Ann calculation in tests after collection config + handle none in collection config by @sanketkedia in #4350
- [ENH] Turn on spann by default by @sanketkedia in #4351
- [CHORE] Fix flaky default_ef test by @jairad26 in #4376
- [CLN]: Move hnsw_provider.open() lock inside the call itself by @sanketkedia in #4383
- [CHORE] Move distributed to default collection configuration by @jairad26 in #4377
- [CHORE] Fix type safety by @jairad26 in #4339
- [ENH] Add Cloudflare Worker AI Embedding Function by @jairad26 in #4389
- [BUG]: Fix Voyage AI EF to match new API by @tazarov in #4392
- [DOC]: Fix wrong cohere import in collection config docs by @tazarov in #4393
- [ENH] Track collection forking events by @Sicheng-Pan in #4395
- [CHORE] remove hnsw-lib from core dependencies by @jairad26 in #4385
- [ENH] Add python & js client support to query on subset of IDs by @jairad26 in #4250
- [ENH]: Huggingface Embedding Server authorization by @MrExplode in #4227
- [ENH]: better errors in
ComputeUnusedFilesOperator
by @codetheweb in #4400 - [ENH] Add Together AI Embedding function in Python and JS by @jairad26 in #4394
- [BUG]: don't try reading HNSW files as sparse indices when performing garbage collection on SPANN collections by @codetheweb in #4401
- [BUG] Prevent unconditional unwrap of collection config by @jasonvigil in #4404
- [BUG]: fix missing tenant ID in version file by @codetheweb in #4408
- [BUG] Make a strongly-consistent get operation. by @rescrv in #4407
- [BUG] Tag prefetch record as IO, fix spinloop in admissionController by @HammadB in #4405
- [TST] Re-enable parallel spann tests with bigger stack size by @sanketkedia in #4409
- [ENH] Wire up regex filter from client to query node by @Sicheng-Pan in #4410
- [CLN] Fix Port Parameter Inconsistency in HttpClient (String to Integer) by @iw4p in #3662
- [BUG] Fix the boundaries for snapshots. by @rescrv in #4411
- [CLN] Remove legacy python FE from tilt by @HammadB in #4414
- [ENH] Fork the wal3 log. by @rescrv in #4416
- [ENH] Enable rust log service in CI and add some tracing. by @rescrv in #4418
- [ENH]: Enable auth for forking by @sanketkedia in #4419
- [BUG] You must ask for strong consistency from S3 GET. by @rescrv in #4420
- [ENH] CLI client refactor by @itaismith in #4417
- [BUG] Handle duplicates in chroma-load. by @rescrv in #4423
- [BUG] ignore trigrams with null terminator byte when constructing full text index by @codetheweb in #4422
- [ENH] Restore chroma-load's random-upsert ability for Reference datasets. by @rescrv in #4425
- [CLN] Use GRPC for chroma-load OTEL by @jasonvigil in #4427
- [ENH] Add support to convert Get/QueryResult to pandas dataframe by @jairad26 in #4304
- [ENH] CLI Browse command by @itaismith in #4431
- [RELEASE] Release CLI 1.1.0 by @itaismith in #4438
- [RELEASE] JS release 2.4.0 by @itaismith in #4439
- [RELEASE] 1.0.8 by @itaismith in #4441
New Contributors
- @j0hnthebuilder made their first contribution in #4270
- @MrExplode made their first contribution in #4227
- @iw4p made their first contribution in #3662
Full Changelog: 1.0.7...1.0.8
cli-1.1.0
CLI release.
1.0.7
Version: 1.0.7
Git ref: refs/tags/1.0.7
Build Date: 2025-04-25T00:33
PIP Package: chroma-1.0.7.tar.gz
Github Container Registry Image: :1.0.7
DockerHub Image: :1.0.7
What's Changed
- [BLD] Switch to blacksmith runners by @HammadB in #4337
- [DOC] Update DeepEval Links by @kritinv in #4344
- [BLD] Setup rust in pypi release, standardize protoc action by @HammadB in #4349
- [BUG]: Setting max_lifetime and max_timeout to None for in-memory DB by @tazarov in #4347
- [BLD] Setup rust, correct var name for token by @HammadB in #4352
- [ENH] Wire up proto defs for sysdb fork endpoint by @Sicheng-Pan in #4299
- [CHORE] Enable rust log service, set timeouts, change tilt. by @rescrv in #4338
- [ENH] Implement collection forking in SysDB by @Sicheng-Pan in #4303
- [ENH] Wire up collection forking for RFE by @Sicheng-Pan in #4309
- [CHORE] Increase default max batch size by @HammadB in #4353
- [ENH] Wire up collection forking for python by @Sicheng-Pan in #4314
- [ENH] Implement log forking by @Sicheng-Pan in #4326
- [CHORE] Make rust action set version to installed version by @HammadB in #4354
- [CHORE] Disable rust log service because it flaked once landing. by @rescrv in #4355
- [BUG] Fix log fork bugs by @Sicheng-Pan in #4356
- [ENH] Add VerifyingDataSet that verifies loaded data by @jasonvigil in #4341
- [ENH] Disable quota and add limit for collection fork by @Sicheng-Pan in #4359
- [CLN] Enable quota for fork by @HammadB in #4360
- [CHORE]: Discord - link to channel by @hesreallyhim in #4363
- [DOC]: Fix documentation around
$ne
and$nin
by @hesreallyhim in #4315 - [BUG] Fix lineage file path and root collection tracing by @Sicheng-Pan in #4361
- [CFG] Disable quota for forking by @Sicheng-Pan in #4364
- [BUG] Fix failing fork on empty collection by @Sicheng-Pan in #4366
- [DOC]: Fix broken Next/Prev link buttons by @hesreallyhim in #4310
- [ENH] Introduce + consume foyer.obtain() by @sanketkedia in #4362
- [DOC]: Fix: (small) fix broken links by @hesreallyhim in #4214
- [ENH] Add hook to initialize collection before running verification by @jasonvigil in #4369
- [ENH] Build chroma-load-start binary and add it to chroma-load container by @jasonvigil in #4370
- [RELEASE] 1.0.7 by @itaismith in #4371
Full Changelog: 1.0.6...1.0.7
1.0.6
Version: 1.0.6
Git ref: refs/tags/1.0.6
Build Date: 2025-04-22T05:44
PIP Package: chroma-1.0.6.tar.gz
Github Container Registry Image: :1.0.6
DockerHub Image: :1.0.6
What's Changed
- [BUG] Create version file on first flush by @Sicheng-Pan in #4284
- [ENH] Add root collection id and lineage file name in collection table by @Sicheng-Pan in #4287
- [BLD] Add test for JS client to check for drift in generated openapi spec by @jairad26 in #4298
- [BUG]: Fix foyer lodc buffer splitter bug by bumping version by @MrCroxx in #4107
- [ENH] Improve backoff and throttling of wal3. by @rescrv in #4302
- [BUG] Fix an ordering and windowing bug in new log service. by @rescrv in #4306
- [ENH] Update Jina embedding function to support all models and configurations by @jairad26 in #4244
- [ENH] Dynamic adjustment of priority for s3 gets + construct reader once + use rw lock by @sanketkedia in #4272
- [BUG] Handle null version file name in query by @Sicheng-Pan in #4311
- [ENH] grpc changes for update collection config by @jairad26 in #4083
- [ENH] return configuration in server response by @jairad26 in #4087
- [ENH] Add collection config to js client by @jairad26 in #4116
- [DOC] Add collection config docs by @jairad26 in #4300
- [ENH] Add Spann configuration to collection config by @jairad26 in #4195
- [BUG] disable alt-host shunt by @rescrv in #4319
- [CHORE] set verbosity for hypothesis to verbose. by @rescrv in #4322
- [ENH] Add user agent to rust frontend traces by @jasonvigil in #4323
- [CHORE] Fix typing on empty configuration json returned, remove validation on num_threads in python by @jairad26 in #4325
- [HOTFIX] Remove batch_size and num_threads on server response, remove client coll config validation by @jairad26 in #4329
- [BUG]: <100 items, >20% delete Wrong distance comparison in local HNSW by @tazarov in #4334
- [BUG] Return a rate limit error when scorecard trips. by @rescrv in #4313
- [DOC]: Fix docstring defaults for query filters by @hesreallyhim in #4282
- [ENH]: Setup Grafana dashboard for foyer by @MrCroxx in #4274
- [CHORE] Demote "Block: written to storage (<>B)" to debug by @rescrv in #4308
- [CHORE]: Local compation manager error propagation for sqlite by @tazarov in #4258
- Update ephemeral-client.md by @ericlabouve-sn in #3921
- [BUG] Segmentation fault in get_block by @sanketkedia in #4336
- [RELEASE] 1.0.6 by @itaismith in #4340
New Contributors
- @ericlabouve-sn made their first contribution in #3921
Full Changelog: 1.0.5...1.0.6
1.0.5
Version: 1.0.5
Git ref: refs/tags/1.0.5
Build Date: 2025-04-16T17:36
PIP Package: chroma-1.0.5.tar.gz
Github Container Registry Image: :1.0.5
DockerHub Image: :1.0.5
What's Changed
- [ENH]: allow specifying environment variables for garbage collector k8s template by @codetheweb in #4249
- [ENH] Docs - better image sizing + formats to improve pageload times by @philipithomas in #4251
- [ENH]: show garbage collector logs in stdout by @codetheweb in #4252
- [ENH] Add ttl to sysdb cache on RFE by @sanketkedia in #4241
- [TST]: enable Rust backtraces in Python binding tests by @codetheweb in #4253
- [CHORE] remove unused page, page_size, and sort args on get by @jairad26 in #4204
- [CHORE] Propogate error messages correctly to user by @jairad26 in #4235
- [TST]: log durations of 10 slowest Python tests by @codetheweb in #4256
- [ENH]: log OpenTelemetry export errors by @codetheweb in #4255
- Update link for C#/.NET library by @cincuranet in #3670
- [ENH] Make snapshot be recursive. by @rescrv in #4085
- [ENH] When recovering a wal3 manifest, mark it dirty. by @rescrv in #4119
- [BUG] Fix race condition. by @rescrv in #4260
- [ENH] Add a scout-logs function to find the max log position. by @rescrv in #4232
- [ENH] Retry query paths on transport errors by @Sicheng-Pan in #4262
- [ENH] Introduce request priority in storage layer by @sanketkedia in #4254
- [ENH]: Prefetch posting list in query + compactor by @sanketkedia in #4257
- [TST]: run test_cross_version_persist tests sequentially by @codetheweb in #4264
- [ENH]: set up rendezvous hashing for collection -> garbage collector node mapping by @codetheweb in #4113
- [ENH] Reenable wal3 by @rescrv in #4261
- [CLN] Deprecate delete list file in GC by @Sicheng-Pan in #4266
- [DOC]: Update improper usage of
$contains
operator in example code and add test to validate it by @hesreallyhim in #4096 - [ENH] Update QuotaExceededError status code and message by @drewkim in #4144
- [CHORE] Add test to validate embedding function error handling by @jairad26 in #4259
- [ENH] Add support for images in Cohere embedding function by @jairad26 in #4269
- [BUG] OBO between log service and compaction. by @rescrv in #4276
- [ENH] add regeneration script for js library by @jairad26 in #4115
- [ENH] Add some tracing spans to wal3. by @rescrv in #4283
- [CHORE] Remove js bindings from release by @jairad26 in #4285
- [ENH] fix discord link by @jeffchuber in #4286
- [ENH] clarify CLI version req for install command by @jeffchuber in #4289
- [BLD] remove chromadb-js-bindings from chromadb package to fix build by @philipithomas in #4291
- [RELEASE] 1.0.5 by @itaismith in #4294
New Contributors
- @cincuranet made their first contribution in #3670
- @hesreallyhim made their first contribution in #4096
Full Changelog: 1.0.4...1.0.5