Services
Custom engagements & applied research
Beyond the off-the-shelf research agent: we work directly with teams that need more, run applied research on the agent itself, and have more services on the way.
Individual subscriptions
If you're working on your own — a PhD student writing a dissertation, a postdoc triaging the literature, a PI prepping a grant — the off-the-shelf research agent is available as a single-user subscription. No procurement, no annual minimums, no custom-engagement scoping call required: pick a plan that matches the volume of queries you expect to run and start using the agent the same day.
Plus is $50/month for 15 queries. Pro is $100/month for 35 queries (most popular). Max is $300/month for 120 queries. There's also Try-it: a one-time $10 purchase for 5 queries with no subscription at all, in case you'd rather kick the tires first. Academic researchers get 20% off any paid tier automatically when signing up with an academic email.
Custom AI engagements
The standard research agent solves one shape of problem well — a question, the public literature, a cited Markdown report back. Plenty of useful work doesn't fit that shape: your data lives behind a license the agent can't touch, your domain needs careful tuning, your pipeline expects a different output, or the workflow is more than a single query at a time.
Engagements we have run or are scoping conversations about:
- Connecting the agent to proprietary databases, internal corpora, or paywalled sources your group already pays for.
- Tuning agent prompts, retrieval, and tool sets for a specific scientific domain or therapeutic area.
- Building bespoke evaluation harnesses so a team can measure agent quality on the questions that actually matter to their pipeline.
- Designing institutional workflows — grant drafts, tumor boards, regulatory submissions, lab-notebook integration — where the agent is one component of a larger system.
We don't pretend to do everything. The strongest engagements are ones where the underlying problem is a research-or-literature problem we have built deep machinery for. If that sounds like the problem you're working on, write us at info@sparkit.science with a paragraph about it. We respond with whether it's a fit, what a scope might look like, and a rough timeline.
Applied research
SPARKIT is built by a team that also runs applied research on agents, evaluation methodology, and AI for science. The HLE-Gold and GAIA numbers on the home page came out of that work; future evaluations and methodology notes land on the public blog rather than stay internal.
Current and recent threads:
- Benchmarking the agent on Humanity's Last Exam (gold subset) and GAIA against frontier models and search APIs.
- Evaluation design for scientific reasoning where ground truth isn't a single number.
- Agent safety work — refusal training, output screening, citation-fidelity audits.
- Public write-ups of real-world workflows (CRISPR-screen triage, hereditary-cancer panels) on the blog.
If you're working on related research and want to collaborate, or want to read the methodology behind a specific evaluation, reach out.
What's coming
We have a small backlog of services we're scoping for specific corners of the research-tooling space. If your use case doesn't fit cleanly into the off-the-shelf agent or a custom engagement as described above, we still want to hear about it — it may shape what we build next. Write us with a paragraph about the problem.
Get in touch
For custom engagements, research collaborations, or anything else: info@sparkit.science.