Getting Started

Python
TypeScript

For production, Chroma offers Chroma Cloud - a fast, scalable, and serverless database-as-a-service. Get started in 30 seconds - $5 in free credits included.

Install

pip install chromadb

Create a Chroma Client

Python

import chromadb
chroma_client = chromadb.Client()

Create a collection

Collections are where you’ll store your embeddings, documents, and any additional metadata. Collections index your embeddings and documents, and enable efficient retrieval and filtering. You can create a collection with a name:

Python

collection = chroma_client.create_collection(name="my_collection")

Add some text documents to the collection

Chroma will store your text and handle embedding and indexing automatically. You can also customize the embedding model. You must provide unique string IDs for your documents.

Python

collection.add(
    ids=["id1", "id2"],
    documents=[
        "This is a document about pineapple",
        "This is a document about oranges"
    ]
)

Query the collection

You can query the collection with a list of query texts, and Chroma will return the n most similar results. It’s that easy!

Python

results = collection.query(
    query_texts=["This is a query document about hawaii"], # Chroma will embed this for you
    n_results=2 # how many results to return
)
print(results)

If n_results is not provided, Chroma will return 10 results by default. Here we only added 2 documents, so we set n_results=2.

Inspect Results

From the above - you can see that our query about hawaii is semantically most similar to the document about pineapple.

Python

{
  'documents': [[
      'This is a document about pineapple',
      'This is a document about oranges'
  ]],
  'ids': [['id1', 'id2']],
  'distances': [[1.0404009819030762, 1.243080496788025]],
  'uris': None,
  'data': None,
  'metadatas': [[None, None]],
  'embeddings': None,
}

Try it out yourself

What if we tried querying with “This is a document about florida”? Here is a full example.

Python

import chromadb
chroma_client = chromadb.Client()

# switch \`create_collection\` to \`get_or_create_collection\` to avoid creating a new collection every time
collection = chroma_client.get_or_create_collection(name="my_collection")

# switch \`add\` to \`upsert\` to avoid adding the same documents every time
collection.upsert(
    documents=[
        "This is a document about pineapple",
        "This is a document about oranges"
    ],
    ids=["id1", "id2"]
)

results = collection.query(
    query_texts=["This is a query document about florida"], # Chroma will embed this for you
    n_results=2 # how many results to return
)

print(results)

Next steps

In this guide we used Chroma’s ephemeral client for simplicity. It starts a Chroma server in-memory, so any data you ingest will be lost when your program terminates. You can use the persistent client or run Chroma in client-server mode if you need data persistence.

Learn how to Deploy Chroma to a server
Join Chroma’s Discord Community to ask questions and get help
Follow Chroma on X (@trychroma) for updates

For production, Chroma offers Chroma Cloud - a fast, scalable, and serverless database-as-a-service. Get started in 30 seconds - $5 in free credits included.

Install

npm install chromadb @chroma-core/default-embed

Create a Chroma Client

Run the Chroma backend:

Terminal CLI

chroma run --path ./getting-started

Terminal Docker

docker pull chromadb/chroma
docker run -p 8000:8000 chromadb/chroma

Then create a client which connects to it:

TypeScript ESM

import { ChromaClient } from "chromadb";
const client = new ChromaClient();

TypeScript CJS

const { ChromaClient } = require("chromadb");
const client = new ChromaClient();

Create a collection

TypeScript

const collection = await client.createCollection({
  name: "my_collection",
});

Add some text documents to the collection

Chroma will store your text and handle embedding and indexing automatically. You can also customize the embedding model. You must provide unique string IDs for your documents.

TypeScript

await collection.add({
  ids: ["id1", "id2"],
  documents: [
    "This is a document about pineapple",
    "This is a document about oranges",
  ],
});

Query the collection

You can query the collection with a list of query texts, and Chroma will return the n most similar results. It’s that easy!

TypeScript

const results = await collection.query({
  queryTexts: "This is a query document about hawaii", // Chroma will embed this for you
  nResults: 2, // how many results to return
});

console.log(results);

If n_results is not provided, Chroma will return 10 results by default. Here we only added 2 documents, so we set n_results=2.

Inspect Results

From the above - you can see that our query about hawaii is semantically most similar to the document about pineapple.

TypeScript

{
    documents: [
        [
            'This is a document about pineapple',
            'This is a document about oranges'
        ]
    ],
    ids: [
        ['id1', 'id2']
    ],
    distances: [[1.0404009819030762, 1.243080496788025]],
    uris: null,
    data: null,
    metadatas: [[null, null]],
    embeddings: null
}

Try it out yourself

What if we tried querying with “This is a document about florida”? Here is a full example.

TypeScript

import { ChromaClient } from "chromadb";
const client = new ChromaClient();

// switch `createCollection` to `getOrCreateCollection` to avoid creating a new collection every time
const collection = await client.getOrCreateCollection({
  name: "my_collection",
});

// switch `addRecords` to `upsertRecords` to avoid adding the same documents every time
await collection.upsert({
  documents: [
    "This is a document about pineapple",
    "This is a document about oranges",
  ],
  ids: ["id1", "id2"],
});

const results = await collection.query({
  queryTexts: ["This is a query document about florida"], // Chroma will embed this for you
  nResults: 2, // how many results to return
});

console.log(results);

Next steps

We offer first class support for various embedding providers via our embedding function interface. Each embedding function ships in its own npm package.
Learn how to Deploy Chroma to a server
Join Chroma’s Discord Community to ask questions and get help
Follow Chroma on X (@trychroma) for updates

Overview

Run Chroma

Collections

Querying Collections

Embeddings

CLI

Install

Create a Chroma Client

Create a collection

Add some text documents to the collection

Query the collection

Inspect Results

Try it out yourself

Next steps

Install

Create a Chroma Client

Create a collection

Add some text documents to the collection

Query the collection

Inspect Results

Try it out yourself

Next steps

Overview

Run Chroma

Collections

Querying Collections

Embeddings

CLI

Install

Create a Chroma Client

Create a collection

Add some text documents to the collection

Query the collection

Inspect Results

Try it out yourself

​Next steps

Install

Create a Chroma Client

Create a collection

Add some text documents to the collection

Query the collection

Inspect Results

Try it out yourself

​Next steps

Next steps

Next steps