BizXEngine API

The memory and intelligence layer for AI agents. Store context, search semantically, and reason over your data — all from a single API.

API v1.0 · Base URL: https://api.bizxengine.com/v1

What is BizXEngine?

BizXEngine is a hosted memory infrastructure API. Rather than managing your own vector databases, embedding pipelines, and retrieval logic, you make a few HTTP calls and get back semantically-ranked, importance-weighted memories tailored to your agent's context.

Unlike raw RAG pipelines, BizXEngine memories carry metadata — importance scores, temporal decay, categories, and access frequency — so retrieval quality improves over time, not degrades.

Five intelligence primitives

Memory Storage

Persistent, structured memory store with TTL support and namespace isolation.

Semantic Search

Vector-based retrieval with multi-factor ranking — not just cosine similarity.

Reasoning

LLM-powered reasoning over retrieved memories. Stream results in real time.

Context Injection

Automatic deduplication and window management. Only inject what matters.

Base URL

HTTPS https://api.bizxengine.com/v1

All API requests must use HTTPS. HTTP is not supported. Requests to the wrong scheme will receive a redirect, not a 200 response.

Request & response format

All request bodies must be JSON (Content-Type: application/json). All responses are JSON. Dates are ISO 8601 UTC strings. Amounts and IDs are strings unless noted.

All API responses include a request_id field. Include this in any support requests — it dramatically speeds up debugging.

Next Quick Start →

Quick Start

From zero to your first memory stored and retrieved in under 5 minutes.

Step 1 — Get your API key

Create an account at app.bizxengine.com. Navigate to Settings → API Keys and create a new key. Keys are prefixed with bxe_.

Your API key is shown only once. Copy it immediately and store it in a secrets manager — never in source code or a public repository.

Step 2 — Store your first memory

bash

curl -X POST https://api.bizxengine.com/v1/memory/write \
  -H "Authorization: Bearer bxe_YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Client XYZ prefers async comms and morning meetings",
    "workspace_id": "ws_abc123",
    "importance": 8
  }'

python

import bizxengine

client = bizxengine.Client(api_key="bxe_YOUR_KEY")

memory = client.memory.write(
    workspace_id="ws_abc123",
    text="Client XYZ prefers async comms and morning meetings",
    importance=8
)

print(memory.id)  # mem_7xKzP3...

typescript

import { BizXEngine } from "@bizxengine/sdk";

const client = new BizXEngine({ apiKey: "bxe_YOUR_KEY" });

const memory = await client.memory.write({
  workspaceId: "ws_abc123",
  text: "Client XYZ prefers async comms and morning meetings",
  importance: 8,
});

console.log(memory.id);

Step 3 — Search your memories

bash

curl -X POST https://api.bizxengine.com/v1/memory/search \
  -H "Authorization: Bearer bxe_YOUR_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "workspace_id": "ws_abc123",
    "query": "communication preferences",
    "top_k": 5
  }'

The response returns ranked memories with relevance scores:

json — response

{
  "memories": [
    {
      "id": "mem_7xKzP3",
      "text": "Client XYZ prefers async comms and morning meetings",
      "score": 0.94,
      "importance": 8,
      "created_at": "2025-03-14T09:41:00Z"
    }
  ],
  "request_id": "req_Kx8mN..."
}

The score field is a composite of semantic similarity (55%), temporal recency (20%), importance (15%), and access frequency (10%) — not raw cosine distance.

← Previous Introduction Next Authentication →

Authentication

BizXEngine uses bearer token authentication. Every request must include a valid API key in the Authorization header.

Authorization header

http

Authorization: Bearer bxe_YOUR_API_KEY

API key types

Type	Prefix	Scope
Secret key	bxe_sk_	Full API access. Use server-side only. Never expose in client code.
Publishable key	bxe_pk_	Read-only memory retrieval. Safe for client-side use.
Workspace key	bxe_ws_	Scoped to a single workspace. Useful for multi-tenant deployments.

Authentication errors

Code	Meaning
401	Missing or malformed Authorization header.
403	Valid key but insufficient permissions for this operation.
429	Rate limit exceeded. See Rate Limits.

← Previous Quick Start Next memory.write →

Workspaces

A Workspace is your Business Memory Brain — an isolated namespace for a single AI agent or product. Your plan limits apply per workspace.

What is a workspace?

Every memory you write belongs to a workspace. Workspaces are fully isolated — memories in ws_sales are never returned in queries against ws_support. You can create as many workspaces as you need under a single BizXEngine account.

Your subscription plan limits (writes, retrievals, reasoning calls) apply per workspace. If you need higher limits, upgrade the workspace's plan in the dashboard.

Creating a workspace

Workspaces are created in the dashboard at app.bizxengine.com. Each workspace gets a unique ID prefixed with ws_. Pass this ID in every API call.

Workspace object

Field	Type	Description
id	string	Unique identifier. Example: `ws_abc123`.
name	string	Human-readable label set in the dashboard.
plan	string	One of `free`, `pro`, `max`, `enterprise`.
created_at	string	ISO 8601 UTC timestamp.
memory_count	integer	Total memories stored in this workspace.

← PreviousAuthentication Nextmemory.write →

memory.write

Store a new memory entry in a workspace. The engine automatically embeds the text, assigns importance, and classifies temporal context.

POST/v1/memory/write

Request parameters

Parameter	Type	Required	Description
workspace_id	string	REQ	Target workspace. Must belong to your account.
text	string	REQ	The memory content. Plain text, max 4,000 chars.
importance	integer		Override importance score 1–10. If omitted, the engine scores it automatically.
expires_at	string		ISO 8601 UTC. Memory decays to near-zero relevance after this date.
tags	string[]		Optional string tags for filtering. Max 10 tags, 64 chars each.
meta	object		Arbitrary JSON metadata attached to the memory. Max 2KB.

Response

json

{
  "id": "mem_7xKzP3qR",
  "workspace_id": "ws_abc123",
  "text": "Client XYZ prefers async comms and morning meetings",
  "importance": 8,
  "tags": [],
  "temporal_type": "persistent",
  "created_at": "2025-03-14T09:41:00Z",
  "request_id": "req_Kx8mN7..."
}

Deduplication

Before storing, the engine checks for semantic duplicates. If a memory with cosine similarity ≥ 0.97 already exists, the existing record is returned with status: "duplicate" and a duplicate_of field pointing to the original ID.

← PreviousWorkspaces Nextmemory.search →

memory.search

Retrieve semantically relevant memories from a workspace, ranked by a multi-factor scoring model.

POST/v1/memory/search

Request parameters

Parameter	Type	Required	Description
workspace_id	string	REQ	Workspace to search within.
query	string	REQ	Natural language query. Max 1,000 chars.
top_k	integer		Number of results to return. Default `5`, max `50`.
min_score	float		Minimum composite score threshold (0–1). Default `0.0`.
tags	string[]		Filter to memories that have all of the specified tags.
include_decayed	boolean		Include expired temporal memories. Default `false`.

Scoring model

The composite score S is computed as a weighted sum:

formula

S = 0.55 × similarity
  + 0.20 × recency
  + 0.15 × importance
  + 0.10 × frequency
  × decay_multiplier

The decay_multiplier reduces scores for old low-importance memories and zeroes out expired temporal ones. High-importance memories (≥9) decay very slowly over 1,000 days.

← Previousmemory.write Nextreason →

reason

Run an LLM-powered reasoning pass over retrieved memories. Supports streaming via Server-Sent Events.

POST/v1/reason

Request parameters

Parameter	Type	Required	Description
workspace_id	string	REQ	Workspace containing context memories.
query	string	REQ	The question or task for the reasoning engine.
stream	boolean		Stream tokens via SSE. Default `false`.
top_k	integer		Memories to inject into context. Default `8`.
model	string		Reasoning model. Default `bxe-reason-1`.

Streaming

When stream: true, the response is an SSE stream. Each event is a JSON delta:

sse stream

event: delta
data: {"token":"Based"}

data: {"token":" on"}

data: {"token":" the"}

event: done
data: {"usage":{"prompt_tokens":312,"completion_tokens":87}}

Each reason call counts as one reasoning call against your plan's monthly limit, regardless of token count.

← Previousmemory.search NextErrors & Codes →

Errors & Status Codes

BizXEngine uses standard HTTP status codes. Every error response includes a machine-readable code and a human-readable message.

Error format

json

{
  "error": {
    "code": "memory_not_found",
    "message": "No memory with ID mem_xyz in workspace ws_abc123.",
    "request_id": "req_Kx8mN7..."
  }
}

Status codes

HTTP	Code	Description
400	invalid_request	Missing required parameter or invalid value.
401	unauthorized	API key missing or malformed.
403	forbidden	Key valid but lacks permission for this resource.
404	not_found	Resource does not exist.
409	conflict	Duplicate memory detected (see dedup).
422	unprocessable	Request well-formed but semantically invalid.
429	rate_limited	Too many requests. Check `Retry-After` header.
500	internal_error	Something went wrong on our end. Contact support with the `request_id`.

← Previousreason NextRate Limits →

Rate Limits

Rate limits are enforced per workspace, per minute. Monthly operation limits are enforced per billing cycle.

Limits by plan

Plan	Req / min	Writes / mo	Retrievals / mo	Reasoning / mo
free	10	1,000	5,000	100
pro	60	25,000	100,000	2,000
max	300	150,000	500,000	15,000
enterprise	custom	unlimited	unlimited	unlimited

Rate limit headers

Every response includes headers so you can track your current usage:

http response headers

X-RateLimit-Limit: 60
X-RateLimit-Remaining: 57
X-RateLimit-Reset: 1741954860
X-Monthly-Writes-Remaining: 24821

When you receive a 429, read the Retry-After header for the exact number of seconds to wait before retrying.

← PreviousErrors & Codes NextChangelog →

Changelog

All notable changes to the BizXEngine API, SDKs, and platform.

v1.0.0 LATEST March 2025

Initial public release of the BizXEngine Memory API.
Five core primitives: memory.write, memory.search, memory.get, memory.delete, reason.
Temporal decay engine with importance-weighted scoring.
Python and TypeScript SDKs.
Workspace isolation and per-workspace billing.

v0.9.0 BETA January 2025

Private beta. Invite-only access.
Core memory engine implemented with pgvector backend.
LLM-powered deduplication and importance scoring.

← PreviousRate Limits

memory.get

Retrieve a single memory by ID.

GET/v1/memory/{id}

Returns the full memory object including text, importance, temporal metadata, and access statistics.

Fetching a memory increments its reference_count, which increases its frequency score in future searches.

← Previousmemory.search Nextmemory.delete →

memory.delete

Permanently delete a memory from a workspace.

DELETE/v1/memory/{id}

Deletion is permanent. The memory and its embedding are removed immediately and cannot be recovered.

← Previousmemory.get Nextmemory.list →

memory.list

Paginate through all memories in a workspace, sorted by creation date.

GET/v1/memory?workspace_id={id}

Supports limit (max 100) and cursor pagination. Returns a next_cursor field if more results exist.

← Previousmemory.delete Nextreason →

context.inject

Returns a ready-to-inject context string from the top-k memories, with deduplication and token budgeting applied.

POST/v1/context/inject

Pass the result directly as the system or context field in your LLM call. BizXEngine handles ranking, window management, and deduplication — you just pass a token budget.

← Previousreason NextMemory Decay →

Memory Decay

BizXEngine automatically deprioritizes stale memories. The decay model ensures long-running agents don't drown in outdated context.

Decay model

Decay is applied as a multiplier (0.05–1.0) on the composite retrieval score. The multiplier depends on memory type, importance, and age.

Condition	Decay multiplier
Expired temporal (past `expires_at`)	0.05 (near-invisible)
Importance ≥ 9, any age	max(0.9, 1 − age/1000)
Importance ≥ 7	max(0.75, 1 − age/500)
All others	max(0.3, 1 − age/180)

Set importance: 9 or 10 for facts that should never fade — contract terms, user preferences, critical configs.

← Previouscontext.inject NextPython SDK →

Python SDK

The official Python client for BizXEngine. Supports Python 3.9+.

Installation

bash

pip install bizxengine

Initializing the client

python

import bizxengine

# Reads BXE_API_KEY from environment if not provided
client = bizxengine.Client(api_key="bxe_YOUR_KEY")

# Write a memory
m = client.memory.write(workspace_id="ws_abc", text="...")

# Search
results = client.memory.search(workspace_id="ws_abc", query="meeting prefs")

← PreviousMemory Decay NextTypeScript SDK →

TypeScript SDK

The official TypeScript / JavaScript client. Works in Node.js 18+ and modern browsers.

Installation

bash

npm install @bizxengine/sdk

Initializing

typescript

import { BizXEngine } from "@bizxengine/sdk";

const client = new BizXEngine({ apiKey: "bxe_YOUR_KEY" });

const memory = await client.memory.write({
  workspaceId: "ws_abc",
  text: "...",
});

← PreviousPython SDK NextErrors & Codes →

Webhooks BETA

Receive real-time HTTP callbacks when events occur in your workspace — memory written, deleted, or when you approach plan limits.

Events

Event	Description
memory.written	A new memory was successfully stored.
memory.deleted	A memory was permanently removed.
plan.limit_warning	Workspace has used 80% of a monthly limit.
plan.limit_reached	A monthly limit has been exhausted.

Webhooks are available on Max and Enterprise plans only.

← PreviousRate Limits NextChangelog →