Featured AI Products
Compute
Build, deploy, and scale cloud compute resources
Containers and Images
Safely store and manage containers and backups
Managed Databases
Fully managed resources running popular database engines
Management and Dev Tools
Control infrastructure and gather insights
Networking
Secure and control traffic to apps
Security
Help protect your account and resources with these security features
Storage
Store and access any amount of data reliably in the cloud
Browse all products
AI/ML
CMS
Data and IoT
Developer Tools
Gaming and Media
Hosting
Security and Networking
Startups and SMBs
Web and App Platforms
See all solutions
Community
Documentation
Developer Tools
Get Involved
Utilities and Help
Become a Partner
Marketplace
Pricing

A More Powerful, Code-First Knowledge Base Experience on the DigitalOcean Gradient™ AI Platform

Updated: February 3, 2026
2 min read

Building production-ready retrieval-augmented generation (RAG) systems can be complex, time-consuming, and often requires months of engineering effort. Developers and enterprises struggle to ingest diverse data sources, structure content for semantic search, and maintain accurate, verifiable answers.

Enhancements to DigitalOcean Gradient™ AI Knowledge Bases, now in public preview, are designed to solve this problem. Its code-first feature lets developers create, manage, and query knowledge bases entirely from code, giving full control over ingestion, chunking, embedding, and retrieval without having to worry about the underlying infrastructure.

Flexible, production-ready toolkit

Many existing solutions let developers create a basic knowledge base, but they often struggle to scale, customize, or integrate it into production workflows. The improvements address this by providing a code-first, developer-focused toolkit that handles the full knowledge base lifecycle. Developers can ingest data from files, Dropbox, web crawlers, control chunking and embedding strategies, and run natural language queries that return citation-backed answers with metadata filters. With well-documented APIs and SDKs, these integrations are seamless, letting developers manage everything entirely in code.

What’s new in the public preview

The public preview highlights the essential tools developers need to build and manage knowledge bases effectively:

Direct API Access: Query knowledge bases directly without needing an agent, giving full control for integration into apps or RAG pipelines.
Customizable ingestion: Ingest content from supported sources such as files, web crawlers, and Dropbox datasets. Supports structured data, sitemap crawling, and accurate parsing of complex PDFs.
Flexible chunking and embedding: Choose the chunking strategy that fits your content and select from high-performance embedding models (including a multi-lingual embedding model). Intelligent defaults allow for quick setup. The latest update allows you to modify and update chunking strategies for both new and existing Knowledge Bases via the data source tab on the DO control Panel or via the API providing more flexibility and improved accuracy for existing content.
Advanced retrieval and citations: Run queries with exact-page citations, metadata filters, and hybrid search.
Developer-first tooling: Fully code-driven SDK and API functions makes creation and integration seamless.

Get started with the improved Knowledge Base experience today

The improvements are available in public preview. Start building smarter AI applications faster by managing your knowledge bases entirely in code. Explore the API documentation to start experimenting, and see how quickly you can turn your data into actionable, context-rich answers.

To explore the Knowledge Base improvements, enable the public preview on your Feature Preview page in the DigitalOcean Cloud Console. Once you’ve opted in, access will be granted within approximately 10–15 minutes.

About the author

Grace Morgan

Author

See author profile

Ai Ml

Start building today

From GPU-powered inference and Kubernetes to managed databases and storage, get everything you need to build, scale, and deploy intelligent applications.

AI/ML

Built for Mass Scale: Hard-Won Lessons from Teams Running High Volume Inference Workloads in Production

Hasan Nabulsi

July 2, 2026
5 min read

Product updates

Run Codex in the cloud – DigitalOcean for Codex is now available

Ari Sigal

June 25, 2026
3 min read

Engineering

The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale

Piyush Srivastava

June 1, 2026
13 min read

AI/ML

A More Powerful, Code-First Knowledge Base Experience on the DigitalOcean Gradient™ AI Platform

By Grace Morgan

Updated: February 3, 2026
2 min read

<- Back to blog home

Flexible, production-ready toolkit

What’s new in the public preview

The public preview highlights the essential tools developers need to build and manage knowledge bases effectively:

Direct API Access: Query knowledge bases directly without needing an agent, giving full control for integration into apps or RAG pipelines.
Customizable ingestion: Ingest content from supported sources such as files, web crawlers, and Dropbox datasets. Supports structured data, sitemap crawling, and accurate parsing of complex PDFs.
Flexible chunking and embedding: Choose the chunking strategy that fits your content and select from high-performance embedding models (including a multi-lingual embedding model). Intelligent defaults allow for quick setup. The latest update allows you to modify and update chunking strategies for both new and existing Knowledge Bases via the data source tab on the DO control Panel or via the API providing more flexibility and improved accuracy for existing content.
Advanced retrieval and citations: Run queries with exact-page citations, metadata filters, and hybrid search.
Developer-first tooling: Fully code-driven SDK and API functions makes creation and integration seamless.

Get started with the improved Knowledge Base experience today

About the author

Grace Morgan

Author

See author profile

Ai Ml

Start building today

From GPU-powered inference and Kubernetes to managed databases and storage, get everything you need to build, scale, and deploy intelligent applications.

AI/ML

Built for Mass Scale: Hard-Won Lessons from Teams Running High Volume Inference Workloads in Production

Hasan Nabulsi

July 2, 2026
5 min read

Product updates

Run Codex in the cloud – DigitalOcean for Codex is now available

Ari Sigal

June 25, 2026
3 min read

Engineering

The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale

Piyush Srivastava

June 1, 2026
13 min read

A More Powerful, Code-First Knowledge Base Experience on the DigitalOcean Gradient™ AI Platform

Flexible, production-ready toolkit

What’s new in the public preview

Get started with the improved Knowledge Base experience today

About the author

Start building today

Related Articles

Built for Mass Scale: Hard-Won Lessons from Teams Running High Volume Inference Workloads in Production

Run Codex in the cloud – DigitalOcean for Codex is now available

The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale

A More Powerful, Code-First Knowledge Base Experience on the DigitalOcean Gradient™ AI Platform

Flexible, production-ready toolkit

What’s new in the public preview

Get started with the improved Knowledge Base experience today

About the author

Start building today

Related Articles

Built for Mass Scale: Hard-Won Lessons from Teams Running High Volume Inference Workloads in Production

Run Codex in the cloud – DigitalOcean for Codex is now available

The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale