Case study · Account sourcing

An account-sourcing engine that scores who is ready to sell

The best acquisition targets are founder-owned businesses that sell before they ever hit a banker. The system that finds them maps a fragmented market, scores every company on a two-factor Fit + Sell-Readiness model, then verifies the top names live in Clay before anyone picks up the phone. A worked example on a residential and light-commercial HVAC thesis across the US Sun Belt.

Apollo.ioGrataClay
Signal scoringFit + Sell-ReadinessData governance
43
companies mapped
11
Tier A shortlist
6
owner emails verified
3
re-checked live in Clay
The problem

Origination is a search problem, not a contact problem

The best acquisition targets are not for sale. They are founder-owned operators with no banker, no data room, and no listing, the kind a buyer wants to reach before a competitive process starts. The hard part is not sending an email. It is knowing, out of hundreds of look-alike companies, which handful is actually approachable right now, and being honest about how confident you are in that read. An origination team that treats every company the same wastes its most expensive resource: a principal’s time on the phone.

The engine

Six steps from thesis to a principal’s call list

Define the buy-box thesis Discover (Apollo / Grata + public web) Enrich (Clay) Score: Fit + Sell-Readiness Verify live in Clay Route to principals

The buy-box for this worked example: residential & light-commercial HVAC / mechanical, roughly $5–30M revenue, US Sun Belt and Southeast, founder-owned, no prior institutional capital. Discovery uses paid databases plus public sources; enrichment and scoring run through Clay. The verification step, highlighted, re-checks the top names against live data before anyone acts on them.

The scoring model

Two factors, one honest number

Every company gets a score out of 100, built from two halves and then discounted for how much I actually trust the data. The split keeps a great-fit company that shows no signal of selling separate from a smaller company whose owner looks genuinely ready.

Fit Score /50
Firmographic match to the buy-box: sub-vertical, revenue / size band, Sun-Belt geography, and recurring-revenue mix (maintenance plans).
Sell-Readiness /50
Inferred ownership signals: owner tenure (by founded year), founder / family ownership with no successor, and the absence of a recent professionalizing hire (GM / CFO / President).
Confidence adj.
Subtract for thin data: high −0, medium −7, low −14. A company cannot rank highly on a guess.
Tiers
A ≥ 80 (high / medium confidence only) · B 63–79 · C < 63.

Sell-Readiness is inferred, never claimed. A high score means the public signals line up the way they tend to before an owner sells, not that anyone intends to sell.

The shortlist

From 43 companies to a call list a principal can work

Origination is a filtering problem, not a contact problem. Each step below is a real cut: score every company on fit and readiness, keep the strong ones, confirm the owners are reachable, then spot-verify before anyone dials. A principal spends time only on the names that survive every gate.

43
Companies mapped
The full thesis-fit universe, scored from public signals.
↓  74% filtered out
11
Tier A shortlist
Strong firmographic fit and strong inferred sell-readiness.
↓  owners confirmed
6
Owner emails verified
Reachable decision-makers confirmed, not guessed.
↓  live verification gate
3
Re-checked live in Clay
Spot-verified the moment before a principal dials.

The two-factor Fit + Sell-Readiness model that produces this shortlist is in the scoring section above; the full ranked list of every scored company is below. The funnel is the whole story in one view: a principal works the 11, starting with the verified names, instead of dialing 43 cold.

The ranked list

The same map, as a call list

Sorted by total score. Company identities are removed; this is the structure a principal works down, not a directory. One company was disqualified outright (already PE-owned), so 42 are scored here.

IDSub-verticalSTFitSellAdjConfScoreTier
T01Multi-trade home servicesTN50420High92A
T02Multi-trade home servicesTX50380High88A
T03Multi-trade home servicesNC50380High88A
T04Multi-trade home servicesTN50380High88A
T05Multi-trade home servicesSC50380High88A
T06Residential / light-commercial HVACFL46350High81A
T07Multi-trade home servicesFL42390High81A
T08Residential / light-commercial HVACAZ46350High81A
T09Residential / light-commercial HVACFL42380High80A
T10Residential / light-commercial HVACGA42380High80A
T11Residential / light-commercial HVACGA42380High80A
T12Multi-trade home servicesAZ46330High79B
T13Multi-trade home servicesFL46310High77B
T14Multi-trade home servicesGA46310High77B
T15Residential / light-commercial HVACGA4638-7Med77B
T16Residential / light-commercial HVACNC46300High76B
T17Residential / light-commercial HVACNC46290High75B
T18Residential / light-commercial HVACFL4238-7Med73B
T19Residential / light-commercial HVACNC46270High73B
T20Residential / light-commercial HVACSC39340High73B
T21Residential / light-commercial HVACTX5029-7Med72B
T22Multi-trade home servicesAZ4633-7Med72B
T23Residential / light-commercial HVACTX33380High71B
T24Residential / light-commercial HVACTN4632-7Med71B
T25Residential / light-commercial HVACFL46240High70B
T26Residential / light-commercial HVACAZ50200High70B
T27Residential / light-commercial HVACTX42270High69B
T28Residential / light-commercial HVACGA29380High67B
T29Residential / light-commercial HVACTX4627-7Med66B
T30Residential / light-commercial HVACGA50160High66B
T31Residential / light-commercial HVACFL4229-7Med64B
T32Residential / light-commercial HVACSC3338-7Med64B
T33Residential / light-commercial HVACTX3436-7Med63B
T34Residential / light-commercial HVACFL4227-7Med62C
T35Residential / light-commercial HVACTN4224-7Med59C
T36Residential / light-commercial HVACSC3926-7Med58C
T37Multi-trade home servicesNC3925-7Med57C
T38Residential / light-commercial HVACTX3724-7Med54C
T39Residential / light-commercial HVACTX3425-7Med52C
T40Residential / light-commercial HVACTN4216-7Med51C
T41Residential / light-commercial HVACNC3719-7Med49C
T42Residential / light-commercial HVACAZ2722-7Med42C

ST = state. Adj = data-confidence adjustment. A medium-confidence target carries a −7 penalty, which is why some high-fit companies still land in B or C.

The verification pass

Before a principal dials, the score gets tested

Every row above is first-pass scored from public signals. That is enough to rank, not enough to act. So the top names run a live verification pass in Clay, owner and tenure re-checked, recent leadership hires pulled, ownership confirmed. I re-ran three of my own Tier A targets. Here is what moved.

Held · Tier A

Verification 1 — confirmed

A second-generation, family-owned multi-trade operator with a recurring maintenance club. Clay confirmed the ownership and tenure and found no recent professionalizing hire. The score held.

Refined · Tier B

Verification 2 — re-checked

A 36-year solo founder with no successor: the strongest sell-readiness signal of the three. But Clay showed a smaller headcount than my first-pass estimate, so the size half of Fit is now uncertain. It drops to B pending a size verification. Strong signal, honest doubt.

Pulled · Tier C

Verification 3 — demoted

Live Clay could not verify the owner, the tenure, or the family-succession story. Unverifiable ownership triggers the confidence penalty, so it falls to C. The score followed the evidence, not my assumption. The gate did its job.

“I re-ran three of my own Tier A targets live in Clay. One held, one shrank on size, and one I had to demote because the ownership data did not hold up. The score follows the evidence. An inferred signal is scored, never presented as fact.”

Why build it in-house

The same work, owned instead of rented

A black-box list you rent

A retainer, a black-box list, and someone else’s definition of a good target. The scoring logic stays hidden, the buy-box cannot be tuned as the thesis shifts, and the relationships live on the vendor’s side. When the retainer stops, the pipeline stops.

A tunable model you own

A transparent, tunable scoring model, a market map that refreshes on a schedule, and verified owner contacts that sit in the CRM, not a vendor’s. Built on Clay and automation instead of headcount, so it runs in-house at low ongoing cost and improves with every pass.

Origination as a system, not a list.

Define the buy-box once, and the market map, scoring, and live verification run on a schedule. The output is a shortlist a team can work on Monday, not a directory. See the rest of the stack, or get in touch.