Admin — Knowledge Bases

Knowledge Bases are Bedrock Knowledge Bases backed by Aurora pgvector. The admin UI lets operators create knowledge bases, upload documents directly to their underlying S3 bucket, trigger Bedrock’s ingestion job, and watch sync status without leaving the app.

The list view

Route: /knowledge-bases
File: apps/admin/src/routes/_authed/_tenant/knowledge-bases/index.tsx

A searchable DataTable with columns for:

Column	Notes
Name	Display name + icon
Status	`creating`, `ready`, `syncing`, `error`
Docs	Total document count
Last Sync	Relative timestamp of the most recent Bedrock ingestion run
Description	Short description

A “Create KB” button in the header fires a creation flow (not deeply visible in the current route — the empty-state CTA is the typical entry point for most operators). Clicking a row navigates to the detail page.

The detail view

Route: /knowledge-bases/:kbId
File: apps/admin/src/routes/_authed/_tenant/knowledge-bases/$kbId.tsx

The detail page is a two-column grid: Documents on the left, Sync on the right, with a header showing the KB name, slug, status badges, and a delete confirmation flow.

Documents card

The Documents card shows every file currently in the KB’s S3 bucket with:

File name
Size
Last modified timestamp
Delete button (per file)

An Upload button opens the file picker. Uploading uses a presigned URL pattern: the admin UI requests a presigned PUT URL from POST /api/knowledge-bases/files with { action: "getUploadUrl", kbId, filename, contentType }, then uploads directly from the browser to S3. The admin server is never in the file-transfer path.

After upload, the document list refreshes via listDocuments(kbId).

Sync card

The Sync card shows Bedrock-side state:

Status — the KB’s own lifecycle state
Last Sync — relative timestamp
Doc Count — total document count
Sync Status — SUCCESS, FAILED, or similar (distinct from the KB status above)
Chunking strategy — how documents are split (fixed-size or semantic chunking)
Chunk size — tokens per chunk
Chunk overlap — percent overlap between chunks
Embedding model — the model used for vector embedding

A Sync Now button triggers SyncKnowledgeBaseMutation, which starts a Bedrock ingestion job. The UI polls KnowledgeBaseDetailQuery every 5 seconds while status === "syncing" to surface progress.

Error state

If the KB has errored (status === "error"), the detail page shows the error message at the top. Common causes: embedding-model quota exhaustion, invalid document formats, S3 permissions drift.

REST endpoint

Document file operations use a single RPC-style endpoint:

POST /api/knowledge-bases/files

With action dispatched through the request body:

Action	Behavior
`list`	Returns `{ files: [{ name, size, lastModified }] }`
`getUploadUrl`	Returns `{ uploadUrl }` — a presigned S3 PUT URL valid ~15 minutes
`delete`	Deletes a named file from the KB’s S3 bucket

The upload URL response is short-lived and regenerated on each upload. The URL is scoped to a specific filename and bucket prefix, so a stolen URL can only overwrite the exact file it was issued for.

GraphQL queries

Query / Mutation	Purpose
`KnowledgeBasesListQuery($tenantId)`	List with summary fields
`KnowledgeBaseDetailQuery($id)`	Full detail with chunking, embedding, sync metadata
`UpdateKnowledgeBaseMutation`	Edit KB metadata (name, description, chunking)
`DeleteKnowledgeBaseMutation($id)`	Hard delete
`SyncKnowledgeBaseMutation($id)`	Trigger a Bedrock ingestion job

The detail query carries most of the interesting fields: status, errorMessage, lastSyncAt, lastSyncStatus, chunkingStrategy, chunkSizeTokens, chunkOverlapPercent, embeddingModel, documentCount.

Assigning a KB to an agent

Knowledge bases are not automatically attached to every agent. The assignment happens on the agent’s Knowledge tab — see Agents → Knowledge. An operator attaches a KB to a specific agent, and the agent retrieves from that KB on every turn.

Data model

knowledge_bases — Aurora table with name, slug, status, description, error_message, chunking_strategy, chunk_size_tokens, chunk_overlap_percent, embedding_model, last_sync_at, last_sync_status, document_count
Source documents — S3 under s3://<bucket>/kbs/<kbId>/<filename>
Vector index — Aurora pgvector extension; the Bedrock ingestion job writes chunks and their embeddings directly into Aurora
Agent attachment — agent_knowledge_bases join table

Because the vector index lives in Aurora, a single Aurora snapshot restore brings back KBs, their vector indexes, and all tenant data in one shot. The only S3 state to worry about on disaster recovery is the source documents themselves, and those have S3 versioning enabled.

Workflows

Create and populate a knowledge base

Click “Create KB” (or the empty-state CTA)
Enter name, description, chunking strategy
Save — the row is created with status: creating → ready
Open the detail page
Upload documents one at a time or in a batch (each upload goes directly to S3 via presigned URL)
Click Sync Now — the KB transitions to syncing
The page polls every 5s until sync completes
Attach the KB to one or more agents on their Knowledge tab
Agents start retrieving from the new KB on their next turn

Update a document

Open the KB detail page
Delete the stale version
Upload the new version
Click Sync Now
Wait for sync to complete — the old chunks are removed and the new chunks are indexed

Tune chunking

Open the detail page
Update chunkingStrategy, chunkSizeTokens, or chunkOverlapPercent (likely via an edit dialog; not deeply visible in route code)
Save — the KB does not re-chunk existing documents automatically
Trigger Sync Now to re-chunk with the new settings

Known limits

No automatic sync on upload. Every upload requires a manual Sync Now click.
Polling is client-side. The admin UI polls KnowledgeBaseDetailQuery every 5 seconds while the sync is in flight. There’s no subscription for sync completion events.
Document upload is per-file. Bulk upload is sequential; the UI fires presigned-URL requests one at a time.
No in-UI document preview. Clicking a file does not open a preview; the admin UI only lists, uploads, and deletes.
Delete is permanent. No archival.

Agents — Knowledge tab — where a KB gets attached to an agent
Memory — the other context layer: stateful memory vs stateless documents
Document Knowledge (concept) — the product model
Retrieval and Context — how retrieval is assembled into a turn
Architecture — the three-tier deployment including Bedrock and Aurora pgvector