Senior AI Researcher _*]:min-w-0 gap-3"> Assail | AI Engineering | Reports to VP of AI Engineering | Remote-friendly, Boston HQ _*]:min-w-0 ...">
All roles

Senior AI Researcher

Remote · USA Full-time New today

_*]:min-w-0 gap-3"> Senior AI Researcher _*]:min-w-0 gap-3"> Assail | AI Engineering | Reports to VP of AI Engineering | Remote-friendly, Boston HQ _*]:min-w-0 gap-3"> About Assail _*]:min-w-0 gap-3"> Assail builds autonomous offensive security. Our platform, Ares, finds vulnerabilities in production systems by reasoning about them the way an experienced attacker would — chaining flaws across APIs, web applications, and mobile surfaces to surface the exploits that scanners miss and human testers run out of time to find. _*]:min-w-0 gap-3"> We train our own models. Dagger is our 14B-parameter offensive security model, fine-tuned for vulnerability discovery and exploit reasoning. Javelin is our co-evolutionary self-training architecture, where attacker and defender models train against each other to push capability further than either could reach alone. The research surface is wide open, the domain is consequential, and the work ships into a platform that's actively used against hardened enterprise targets. _*]:min-w-0 gap-3"> The Role _*]:min-w-0 gap-3"> We're hiring our first dedicated AI Researcher to advance the core models powering Ares. You'll work alongside our VP of AI Engineering and a small AI engineering team, with direct collaboration with our CEO — a researcher and practitioner with 26 years of offensive security experience, contributions to the OWASP API Security Top 10, and a permanent exhibit at The Mob Museum. _*]:min-w-0 gap-3"> This is a research role, not an applied ML role. You'll own original research on offensive security agents — how they reason, plan, use tools, and operate autonomously over long horizons. You'll design experiments end-to-end, build the evaluation infrastructure the field doesn't yet have, and translate research wins into capability that ships. _*]:min-w-0 gap-3"> The feedback loop is fast and adversarial. Research that proves out goes into production. Research that doesn't gets killed quickly so the next bet can start. _*]:min-w-0 gap-3"> What You'll Do _*]:min-w-0 gap-3"> Drive original research on offensive security agents — reasoning, planning, tool use, and autonomous long-horizon operation Advance Dagger's post-training pipeline: supervised fine-tuning, RL from verifier signals, LoRA adaptation, and evaluation against adversarial benchmarks Extend Javelin's co-evolutionary self-training architecture: curriculum design, self-play dynamics, and reward modeling for security-specific outcomes Design and execute experiments end-to-end, from hypothesis through writeup Build internal evaluation harnesses that measure capability rigorously, where no public benchmark exists Translate research into production handoffs to AI Engineering — model cards, deployment notes, and known failure modes Contribute to Assail's external research voice through papers, talks, responsible disclosures, and technical writing Collaborate with engineering teammates on research methodology and experimental design _*]:min-w-0 gap-3"> What We're Looking For _*]:min-w-0 gap-3"> You don't need every item on this list. We care more about depth where you have it than breadth where you don't. _*]:min-w-0 gap-3"> Core experience that matters most: _*]:min-w-0 gap-3"> Original ML research output — published papers, widely cited preprints, significant open-source releases, or shipped research that materially advanced a production system Hands-on post-training experience with language models at the 7B+ parameter scale, end-to-end ownership of a pipeline including data, training, and evaluation Direct work with at least one of: RL from verifier or reward signals, preference optimization (DPO/IPO/KTO), or supervised fine-tuning with synthetic data pipelines Experience with agentic LLM systems — tool use, multi-step reasoning, planning, or long-horizon execution Ability to design evaluation that measures real capability and avoids contamination or specification gaming Strong Python and PyTorch, with experience in distributed training at multi-GPU scale Clear technical writing — research memos, experiment writeups, papers, or equivalent _*]:min-w-0 gap-3"> Helpful but learnable here: _*]:min-w-0 gap-3"> Working knowledge of offensive security fundamentals (we'll teach you the rest if you bring strong ML depth) Prior work on code-generating or code-reasoning models Experience with sparse, delayed, or expensive reward signals in RL Research on robustness, adversarial ML, or red-teaming of language models Familiarity with long-horizon agent benchmarks (SWE-bench, Cybench, WebArena, or similar) _*]:min-w-0 gap-3"> Things we deliberately don't require: _*]:min-w-0 gap-3"> A PhD. Track record matters more than the credential. If your work demonstrates the capability, the degree is secondary. A security background. Strong ML researchers can develop security depth here, and we'll support you in doing it. A specific number of years. Senior is a function of judgment and output, not a count. _*]:min-w-0 gap-3"> What This Role Will Teach You _*]:min-w-0 gap-3"> How to train and post-train capable models in a narrow, high-stakes domain How to design evaluation that holds up to scrutiny when no benchmark exists yet How agentic systems behave under adversarial conditions — including failure modes that don't appear in benign settings The full offensive security stack — API, web, and mobile — at a depth most ML researchers never reach How to make publication and disclosure decisions for dual-use research How research moves from hypothesis to production in a small team where the handoff is measured in days _*]:min-w-0 gap-3"> What We Offer _*]:min-w-0 gap-3"> Competitive base salary and meaningful early-stage equity Comprehensive health and dental coverage Unlimited paid time off, including parental leave Conference, publication, and continued learning budget — we want you engaged with the research community The chance to work on a problem that matters, with people who care about doing it well Apply To This Job

Related roles

Account Executive

Remote · USA Full-time

Web Designer

Remote · USA Full-time

Professional Support Network Coordinator(PSN) - Global

Remote · USA Full-time

CUSTOMER SUCCESS ADVOCATE I

Remote · USA Full-time

MGR CUSTOMER SUCCESS

Remote · USA Full-time

Business Development Manager

Remote · USA Full-time

Sr Solution Engineer

Remote · USA Full-time

Data Management Specialist

Remote · USA Full-time

Developer (IT@JH Financial Systems)

Remote · USA Full-time

Senior Client Operations Specialist

Remote · USA Full-time

Experienced Customer Service Representative - Remote Work Opportunity for Teens at arenaflex

Remote · USA Full-time

Experienced Customer Service Representative – Work From Home Opportunity at arenaflex

Remote · USA Full-time

Associate Director - NERC O&P and CIP Compliance

Remote · USA Full-time

Math Tutor 6th - 12th

Remote · USA Full-time

Experienced Customer Service Representative – Automotive Warranty and Ancillary Products

Remote · USA Full-time

Event Manager – Michelin Account (Onsite, Greenville, SC)

Remote · USA Full-time

Experienced Administrative Clerk / Claims Specialist / Data Entry Professional – Remote Opportunity in New York State

Remote · USA Full-time

Experienced Full Stack Data Analyst – Web & Cloud Application Development

Remote · USA Full-time

Experienced Live Chat Teleperformance Representative – Part-Time Remote Customer Service Position at arenaflex

Remote · USA Full-time

Experienced Customer Service/Sales Representative – 100% Work From Home Union Position

Remote · USA Full-time