The Astrophysical Journal accepted our AI's FRB discovery after three rounds of peer review, but the AAS editorial office then halted the paper over an AI-disclosure review, not the science.
Read moreDropstone Pro 1.5
Frontier agentic coding within 0.2 points of Claude Opus 4.7 on SWE-bench Verified. Built on DeepSeek V4 Pro, hosted in the US.
Announcements
Why Blankline is working towards a 10-million agent architecture grounded in directed learning and safety.
Read moreDecoupling reasoning depth from context length via Recursive Swarm Architecture and heterogeneous inference routing.
Read moreBy decoupling probabilistic reasoning from deterministic state, D3 eliminates context saturation enabling reliable, long-horizon autonomous engineering agents.
Read moreMeasured head-to-head.
Standard public coding benchmarks, latest published runs.

Everything Pro ships with.
Dropstone Pro is the default workhorse for real engineering. It scores 80.6% on SWE-bench Verified, 0.2 points behind Claude Opus 4.7, leads Opus on LiveCodeBench and Terminal-Bench 2.0, and serves at a fraction of frontier-closed-model cost. Same approval-gated CLI, same zero-retention US inference, dramatically more turns per dollar.
Full-stack feature work
Plan a feature, edit across the repo, run the test suite, fix the failures. One approval gate per tool call, frontier-grade reasoning behind it.
Agentic debugging
Hand Pro a stack trace and let it grep, edit, and re-run until the test goes green. Designed for the SWE-bench-shaped tasks that close real GitHub issues.
Code review at scale
Run Pro across a PR diff to flag security, performance, and style issues, with the same OWASP coverage as Dropstone's safety analyzer.
The security boundary is the approval gate.
Every Dropstone request is treated as if the model could be adversarial. The CLI requires explicit user approval before any action that writes to disk, runs a shell command, or fetches a URL. No model output is ever auto-executed.
US-hosted inference, zero retention.
Pro runs on SOC 2-certified, US-based inference providers. Prompts and completions are not retained, not used for training, and not shared with the model's original operator.
Honest about what we cannot prove.
Pro is built on DeepSeek V4 Pro, an open-weight foundation model. Goldwasser et al. (2022) proved no party can prove a closed foundation model is free of embedded behaviors, including Anthropic for Claude and OpenAI for GPT. We say this out loud. The runtime is why model origin does not matter for your code.
Three tiers. One CLI.
Same approval gate, same US-hosted inference, same zero-retention guarantee across all three. Pick the smallest model that meets the task.
Dropstone Fast 1.5
Sub-second agentic coding for edits, refactors, and high-throughput inline completion. Built on DeepSeek V4 Flash, hosted in the US.
- Inline completion
- File-scoped edits
- Test scaffolding
- Approval gate on every tool call
- US-hosted inference
- Zero retention
- Monthly model refresh
Dropstone Pro 1.5
Frontier agentic coding within 0.2 points of Claude Opus 4.7 on SWE-bench Verified. Built on DeepSeek V4 Pro, hosted in the US.
- Full-stack feature work
- Agentic debugging
- Code review at scale
- Approval gate on every tool call
- US-hosted inference
- Zero retention
- Monthly model refresh
Dropstone Heavy 1.5
Long-horizon agentic coding with 300-sub-agent swarms and 4,000 coordinated steps. Built on Kimi K2.6, hosted in the US.
- Multi-day refactors
- Agent swarms
- Frontier research workflows
- Approval gate on every tool call
- US-hosted inference
- Zero retention
- Monthly model refresh
Fast is included on every plan. Pro and Heavy are unlocked on the Pro plan ($15/mo) and Max plan ($75/mo).
Questions you'd ask in a security review.
When should I use Dropstone Pro 1.5?
Is my code sent to a Chinese model provider?
What changes when Dropstone Pro 1.5 gets refreshed?
Do you offer enterprise deployment?
Ship with Dropstone Pro 1.5.
Install the CLI, authenticate, and start running approval-gated agentic workflows. No credit card to start.