Shareable product description

SourceKit finds engineers by what they've built.

SourceKit is an evidence-first sourcing system for technical hiring. You start with a role or JD, SourceKit turns it into a discovery plan, verifies signal across GitHub and market context, then returns a ranked pipeline that is ready for outreach.

artifact-first discovery EEA criteria support persistent Websets parallel company graph ranked candidate pipeline

Run your first search

SourceKit

Exa

Claude

Harmonic

Parallel

GitHub

Quick Start 10 minutes

Quick start paths

Pick one input path, get ranked candidates, and take one clear first action.

Role + Company

Fastest

Input

`Staff Backend Engineer` + target company context.

Expected output

Repo set, shortlist of contributor-led candidates, score distribution.

First action

Edit top repos before running outreach.

Full JD

Most complete

Input

Paste full JD with stack, seniority, and constraints.

Expected output

Criteria draft + market-adjacent discovery + stronger filtering.

First action

Convert top criteria to binary EEA checks.

Job URL

Operational

Input

Paste Lever/Greenhouse/Ashby link.

Expected output

Autoparsed scope, ranked candidates, Webset-ready criteria.

First action

Promote durable searches into a weekly Webset.

Auto-suggested EEA

Every search run

Suggested EEA signals are added for every search

If strategy output has no usable EEA criteria, SourceKit seeds 3-5 draft checks you can edit before creating a Webset. This keeps every search evidence-first by default.

Founding ML Engineer

ML + infra

Contribution ownership in core repos (maintainer/reviewer/top contributor).
Production ML system ownership with reliability/scale evidence.
Model-infrastructure depth across training/serving/tooling surfaces.
Public technical artifact tied to shipped ML work.

Staff Backend (Distributed)

Systems

Contribution ownership in distributed systems repos.
Reliability/latency/throughput improvement ownership.
Maintainer/reviewer/RFC-author behavior on infra projects.
Recent shipped impact in the last 12 months.

Security Engineer

Security

Security remediation ownership (CVE/advisory/critical patch).
Public security artifact (talk, writeup, audit, or analysis).
Contribution ownership in security-focused repositories.
Recent shipped security impact with public proof.

Staff Frontend Platform

Frontend

Framework/tooling contribution beyond app-level changes.
Platform performance or build-system ownership evidence.
Contribution ownership in shared frontend surfaces.
Public artifact showing cross-team DX impact.

Value: Faster setup

Operators start from concrete criteria in every run instead of writing EEA checks from scratch.

Value: Lower noise

Binary evidence criteria reduce false positives before scoring and outreach effort accumulates.

Value: Better spend

Verification-first criteria keep enrichment and outreach spend focused on candidates with proof.

Value

Value proposition

Why teams use SourceKit instead of title search and static LinkedIn filters.

Hidden gem rate

~40%

Top candidates often have limited profile visibility. Artifact-led discovery surfaces strong builders before they become heavily recruited.

Pipeline behavior

Always on

Websets convert one strong search into a persistent, auto-updating candidate stream with verified entrants.

Screening logic

Binary proof

Criteria can be framed as pass/fail against public evidence, reducing soft interpretation and resume-style noise.

Planning flow 6 stages

End-to-end walkthrough

The operating flow from intake to pipeline. Click each stage to view what to do, what the system does, and what success looks like.

Operator action

Start from role input, full JD, or job URL.

Input: role statement, full JD, or job URL.
Action: keep stack and constraints explicit.
Output: cleaner repo targeting at planning step.

Details

Specificity at intake is the highest leverage quality control for the entire run.

System output

Normalized role context for planning APIs.

Success signal

Role statement is precise enough to eliminate generic repo suggestions.

Discovery layer

Company graph

Context layer

Feature reference

Core capabilities and where they create leverage in the workflow.

Role to Search Strategy

Planning

Accepts role text, full JD, or job URL.
Generates repo targets, company targets, and criteria draft.
Supports manual refinement before execution.

Multi-API Discovery Layer

Discovery

Combines Exa, Parallel, and GitHub signals.
Expands discovery through technical and market adjacency.
Returns artifact-backed candidate context.

Builder Score Evaluation

Scoring

Scores candidates 0-100 across contribution dimensions.
Weights recency, commit velocity, stack match, and impact.
Produces ranked shortlist with evidence markers.

Pipeline Workflow

Execution

Stage-based candidate movement for sourcing operations.
Supports compare, summarize, and batch actions.
Designed for recruiter throughput after technical filtering.

Exa Websets

Persistent search

Creates auto-updating candidate collections from criteria.
Appends new verified matches on schedule.
Supports enrich, monitor, and override workflows.

Exports and Integrations

Output

Exports via API and CSV for downstream workflows.
Feeds Clay/Parallel-style sequencing and enrichment flows.
Keeps artifact-level signal attached to candidate records.

Operator playbook

How to get maximum value

Operational habits that improve quality and reduce wasted effort.

Do this

Use precise role framing.

Specific role language drives better repo targeting and less scoring noise.

Edit the repo list every run.

Repo quality is the biggest upstream lever for candidate quality.

Define 3-5 verifiable EEA markers.

Use objective proof signals before enrichment and outreach.

Convert durable roles to Websets.

Let the best searches compound through weekly or daily monitoring.

Avoid this

Generic criteria text.

Criteria like "strong engineer" will inflate false positives.

Title-only filtering.

Front-loading title filters reintroduces profile bias and misses builders.

Enriching before verification.

Verify first, enrich survivors second to protect spend and quality.

Single-market assumptions.

Use adjacency signals to reach less saturated ecosystems.

Criteria quality

Adjacency graph

Webset operations