Model v4 Now Available
Extract entities, classify intent, and generate schema-constrained outputs from unstructured text. 200K context window. Sub-200ms P95 latency.
Response — 143ms
[N] developers ship with Nexus API
Send a request, get structured data back. No post-processing needed.
What the model does well, and what it was designed for.
Process entire codebases, legal documents, or research papers in a single request. No chunking, no lost context.
Input and output windows are independently configurable per request.
Define a JSON Schema and get guaranteed-valid responses. No post-processing, no regex parsing.
Works with nested objects, arrays, enums, and recursive schemas.
1{
2 "sentiment": "positive",
3 "confidence": 0.94,
4 "entities": [
5 {
6 "text": "Nexus AI",
7 "type": "ORG"
8 }
9 ]
10}
SOC 2 Type II certified. Data encrypted at rest and in transit. Zero data retention on API requests by default.
Custom data residency available for regulated industries.
Define tools as JSON Schema, and the model decides when and how to invoke them. Supports parallel and sequential tool use.
Every model undergoes adversarial evaluation, bias benchmarking, and alignment tuning before release. Published model cards include failure modes and known limitations.
External adversarial testing by independent security researchers before each model release.
Continuous monitoring across demographic axes with published evaluation results and methodology.
Multi-layer output filtering with configurable sensitivity for enterprise needs.
Pay only for what you use. No seat licenses, no minimums.
| Model | Input / 1M tokens | Output / 1M tokens | Context |
|---|---|---|---|
| nexus-v4 | $[N] | $[N] | 200K |
| nexus-v4-mini | $[N] | $[N] | 128K |
| nexus-embed | $[N] | — | 8K |
| nexus-vision | $[N] | $[N] | 128K + images |
Volume discounts available above $[N]/month. Contact sales
Get your API key in seconds. First $[N] of usage is free.