• Blog
  • Docs
  • Careers
  • Get Support
  • Contact Sales
DigitalOcean
  • Featured AI Products

    Compute

    Build, deploy, and scale cloud compute resources

    Containers and Images

    Safely store and manage containers and backups

    Managed Databases

    Fully managed resources running popular database engines

    Management and Dev Tools

    Control infrastructure and gather insights

    Networking

    Secure and control traffic to apps

    Security

    Help protect your account and resources with these security features

    Storage

    Store and access any amount of data reliably in the cloud

    Browse all products

  • AI/ML

    CMS

    Data and IoT

    Developer Tools

    Gaming and Media

    Hosting

    Security and Networking

    Startups and SMBs

    Web and App Platforms

    See all solutions

  • Community

    Documentation

    Developer Tools

    Get Involved

    Utilities and Help

  • Become a Partner

    Marketplace

  • Pricing
  • Log in
  • Sign up
  • Log in
  • Sign up

Company

  • About
  • Leadership
  • Blog
  • Careers
  • Customers
  • Partners
  • Referral Program
  • Affiliate Program
  • Press
  • Legal
  • Privacy Policy
  • Security
  • Investor Relations

Products

  • GPU Droplets
  • Bare Metal GPUs
  • Inference Engine
  • Data & Learning
  • Evaluations
  • Model Library
  • Droplets
  • Kubernetes
  • Functions
  • App Platform
  • Load Balancers
  • Managed Databases
  • Spaces
  • Block Storage
  • Network File Storage
  • API
  • Uptime
  • Cloud Security Posture Management (CSPM)
  • Identity and Access Management (IAM)
  • Cloudways
  • View all Products

Resources

  • Community Tutorials
  • Community Q&A
  • CSS-Tricks
  • Write for DOnations
  • Currents Research
  • DigitalOcean Startups
  • Wavemakers Program
  • Compass Council
  • Open Source
  • Newsletter Signup
  • Marketplace
  • Pricing
  • Pricing Calculator
  • Documentation
  • Release Notes
  • Code of Conduct
  • Shop Swag

Solutions

  • AI Training GPU
  • GPU Inference
  • VPS Hosting
  • Website Hosting
  • VPN
  • Docker Hosting
  • Node.js Hosting
  • Web Mobile Apps
  • WordPress Hosting
  • Virtual Machines
  • View all Solutions

Contact

  • Support
  • Sales
  • Report Abuse
  • System Status
  • Share your ideas

Company

  • About
  • Leadership
  • Blog
  • Careers
  • Customers
  • Partners
  • Referral Program
  • Affiliate Program
  • Press
  • Legal
  • Privacy Policy
  • Security
  • Investor Relations

Products

  • GPU Droplets
  • Bare Metal GPUs
  • Inference Engine
  • Data & Learning
  • Evaluations
  • Model Library
  • Droplets
  • Kubernetes
  • Functions
  • App Platform
  • Load Balancers
  • Managed Databases
  • Spaces
  • Block Storage
  • Network File Storage
  • API
  • Uptime
  • Cloud Security Posture Management (CSPM)
  • Identity and Access Management (IAM)
  • Cloudways
  • View all Products

Resources

  • Community Tutorials
  • Community Q&A
  • CSS-Tricks
  • Write for DOnations
  • Currents Research
  • DigitalOcean Startups
  • Wavemakers Program
  • Compass Council
  • Open Source
  • Newsletter Signup
  • Marketplace
  • Pricing
  • Pricing Calculator
  • Documentation
  • Release Notes
  • Code of Conduct
  • Shop Swag

Solutions

  • AI Training GPU
  • GPU Inference
  • VPS Hosting
  • Website Hosting
  • VPN
  • Docker Hosting
  • Node.js Hosting
  • Web Mobile Apps
  • WordPress Hosting
  • Virtual Machines
  • View all Solutions

Contact

  • Support
  • Sales
  • Report Abuse
  • System Status
  • Share your ideas
© 2026 DigitalOcean, LLC.Sitemap.
AI/ML

A More Powerful, Code-First Knowledge Base Experience on the DigitalOcean Gradient™ AI Platform

author

By Grace Morgan

  • Updated: February 3, 2026
  • 2 min read
<- Back to blog home

Building production-ready retrieval-augmented generation (RAG) systems can be complex, time-consuming, and often requires months of engineering effort. Developers and enterprises struggle to ingest diverse data sources, structure content for semantic search, and maintain accurate, verifiable answers.

Enhancements to DigitalOcean Gradient™ AI Knowledge Bases, now in public preview, are designed to solve this problem. Its code-first feature lets developers create, manage, and query knowledge bases entirely from code, giving full control over ingestion, chunking, embedding, and retrieval without having to worry about the underlying infrastructure.

Flexible, production-ready toolkit

Many existing solutions let developers create a basic knowledge base, but they often struggle to scale, customize, or integrate it into production workflows. The improvements address this by providing a code-first, developer-focused toolkit that handles the full knowledge base lifecycle. Developers can ingest data from files, Dropbox, web crawlers, control chunking and embedding strategies, and run natural language queries that return citation-backed answers with metadata filters. With well-documented APIs and SDKs, these integrations are seamless, letting developers manage everything entirely in code.

What’s new in the public preview

The public preview highlights the essential tools developers need to build and manage knowledge bases effectively:

  • Direct API Access: Query knowledge bases directly without needing an agent, giving full control for integration into apps or RAG pipelines.
  • Customizable ingestion: Ingest content from supported sources such as files, web crawlers, and Dropbox datasets. Supports structured data, sitemap crawling, and accurate parsing of complex PDFs.
  • Flexible chunking and embedding: Choose the chunking strategy that fits your content and select from high-performance embedding models (including a multi-lingual embedding model). Intelligent defaults allow for quick setup. The latest update allows you to modify and update chunking strategies for both new and existing Knowledge Bases via the data source tab on the DO control Panel or via the API providing more flexibility and improved accuracy for existing content.
  • Advanced retrieval and citations: Run queries with exact-page citations, metadata filters, and hybrid search.
  • Developer-first tooling: Fully code-driven SDK and API functions makes creation and integration seamless.

Get started with the improved Knowledge Base experience today

The improvements are available in public preview. Start building smarter AI applications faster by managing your knowledge bases entirely in code. Explore the API documentation to start experimenting, and see how quickly you can turn your data into actionable, context-rich answers.

To explore the Knowledge Base improvements, enable the public preview on your Feature Preview page in the DigitalOcean Cloud Console. Once you’ve opted in, access will be granted within approximately 10–15 minutes.

About the author

Grace Morgan
Grace Morgan
Author
See author profile
See author profile

Share

  • Ai Ml

Start building today

From GPU-powered inference and Kubernetes to managed databases and storage, get everything you need to build, scale, and deploy intelligent applications.
Sign up

Related Articles

Built for Mass Scale: Hard-Won Lessons from Teams Running High Volume Inference Workloads in Production
AI/ML

Built for Mass Scale: Hard-Won Lessons from Teams Running High Volume Inference Workloads in Production

Hasan Nabulsi
  • July 2, 2026
  • 5 min read

Read more

Run Codex in the cloud – DigitalOcean for Codex is now available
Product updates

Run Codex in the cloud – DigitalOcean for Codex is now available

Ari Sigal
  • June 25, 2026
  • 3 min read

Read more

The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale
Engineering

The Inference Tax: How Prefix-Aware Routing Eliminates the Hidden Cost of LLMs at Scale

Piyush Srivastava
  • June 1, 2026
  • 13 min read

Read more