Dropstone

Dropstone Heavy 1.5

Long-horizon agentic coding with 300-sub-agent swarms and 4,000 coordinated steps. Built on Kimi K2.6, hosted in the US.

Built on
Moonshot Kimi K2.6
Hosted in
US (SOC 2)
Approval gate
Every tool call
Refresh cadence
Monthly
Announcements

Announcements

NewAn AI did the astrophysics. The paper got halted.Apr 18, 2026

The Astrophysical Journal accepted our AI's FRB discovery after three rounds of peer review, but the AAS editorial office then halted the paper over an AI-disclosure review, not the science.

Read more
The Path to Safe AGI: Scaling Automated ResearchDec 20, 2025

Why Blankline is working towards a 10-million agent architecture grounded in directed learning and safety.

Read more
Horizon Mode: Breaking the Linearity Barrier in Autonomous EngineeringDec 19, 2025

Decoupling reasoning depth from context length via Recursive Swarm Architecture and heterogeneous inference routing.

Read more
Introducing the Dropstone D3 EngineDec 19, 2025

By decoupling probabilistic reasoning from deterministic state, D3 eliminates context saturation enabling reliable, long-horizon autonomous engineering agents.

Read more
Benchmarks

Measured head-to-head.

Standard public coding benchmarks, latest published runs.

Dropstone Heavy 1.5 benchmarks
Overview

Everything Heavy ships with.

Dropstone Heavy is the tier for the work that takes a whole afternoon and a whole repository. Built on Moonshot Kimi K2.6, scoring 58.6 on SWE-Bench Pro (vs 53.4 for Claude Opus 4.6) and 54.0 on Humanity's Last Exam with tools, leading every published model. Scales horizontally to 300 sub-agents executing 4,000 coordinated steps inside the same approval-gated CLI.

01

Multi-day refactors

Migrate a monorepo from one framework to another. Heavy plans the work, decomposes it across hundreds of sub-agents, and reports back when the build is green.

02

Agent swarms

Up to 300 parallel sub-agents working on different parts of a codebase at once, coordinated through the same approval gate that protects every other Dropstone session.

03

Frontier research workflows

Tool-augmented reasoning over scientific datasets. The same workflow that drives Blankline's FRB and wormhole analysis pipelines.

Trust & safety

Security comes from the runtime, not the weights.

Read the full system card

The security boundary is the approval gate.

Every Dropstone request is treated as if the model could be adversarial. The CLI requires explicit user approval before any action that writes to disk, runs a shell command, or fetches a URL. No model output is ever auto-executed.

US-hosted inference, zero retention.

Heavy runs on SOC 2-certified, US-based inference providers. Prompts and completions are not retained, not used for training, and not shared with the model's original operator.

Honest about what we cannot prove.

Heavy is built on Moonshot Kimi K2.6, an open-weight foundation model. Goldwasser et al. (2022) proved no party can prove a closed foundation model is free of embedded behaviors, including Anthropic for Claude and OpenAI for GPT. We say this out loud. The runtime is why model origin does not matter for your code.

Pick your tier

Three tiers. One CLI.

Same approval gate, same US-hosted inference, same zero-retention guarantee across all three. Pick the smallest model that meets the task.

Dropstone Fast

Dropstone Fast 1.5

Sub-second agentic coding for edits, refactors, and high-throughput inline completion. Built on DeepSeek V4 Flash, hosted in the US.

Best for
  • Inline completion
  • File-scoped edits
  • Test scaffolding
Security
  • Approval gate on every tool call
  • US-hosted inference
  • Zero retention
  • Monthly model refresh
Dropstone Pro

Dropstone Pro 1.5

Frontier agentic coding within 0.2 points of Claude Opus 4.7 on SWE-bench Verified. Built on DeepSeek V4 Pro, hosted in the US.

Best for
  • Full-stack feature work
  • Agentic debugging
  • Code review at scale
Security
  • Approval gate on every tool call
  • US-hosted inference
  • Zero retention
  • Monthly model refresh
You are here
Dropstone Heavy

Dropstone Heavy 1.5

Long-horizon agentic coding with 300-sub-agent swarms and 4,000 coordinated steps. Built on Kimi K2.6, hosted in the US.

Best for
  • Multi-day refactors
  • Agent swarms
  • Frontier research workflows
Security
  • Approval gate on every tool call
  • US-hosted inference
  • Zero retention
  • Monthly model refresh

Fast is included on every plan. Pro and Heavy are unlocked on the Pro plan ($15/mo) and Max plan ($75/mo).

FAQ

Questions you'd ask in a security review.

When should I use Dropstone Heavy 1.5?
Dropstone Heavy is the tier for the work that takes a whole afternoon and a whole repository. Built on Moonshot Kimi K2.6, scoring 58.6 on SWE-Bench Pro (vs 53.4 for Claude Opus 4.6) and 54.0 on Humanity's Last Exam with tools, leading every published model. Scales horizontally to 300 sub-agents executing 4,000 coordinated steps inside the same approval-gated CLI.
Is my code sent to a Chinese model provider?
No. Inference runs on SOC 2-certified, US-based providers. The model weights are open-source and loaded into those providers' US data centers. Your prompts never touch a foreign network.
What changes when Dropstone Heavy 1.5 gets refreshed?
The version number and benchmarks. The CLI surface, the pricing structure, the approval-gate behavior, and the security model do not change. Existing scripts and CI pipelines continue working without modification.
Do you offer enterprise deployment?
Yes. Dropstone Enterprise extends the same audited-tier platform with VPC, on-premises, and air-gapped deployments, SSO, audit logs, and custom SLAs. Pricing is seat plus usage at API rates, on annual commitments. Read the Enterprise plan overview or contact enterprise@blankline.org.

Ship with Dropstone Heavy 1.5.

Install the CLI, authenticate, and start running approval-gated agentic workflows. No credit card to start.