---
name: ai-lead-sourcing
description: Automatically discover and import leads matching ICP criteria from B2B databases (ZoomInfo, Apollo, LinkedIn Sales Navigator, Lusha). Use when building prospect lists, sourcing leads from databases, defining ICP criteria, enriching contact data, deduplicating lead lists, or automating lead import to CRM. Triggers on phrases like "lead sourcing", "ICP criteria", "prospect list building", "ZoomInfo search", "Apollo leads", "LinkedIn prospecting", "database lead generation", "B2B data sourcing", "contact enrichment", "lead import".
---

# AI-Powered Lead Sourcing from Database

Automatically discover, enrich, deduplicate, and import leads matching Ideal Customer Profile (ICP) criteria from multiple B2B data sources.

## ICP Definition & Lead Sourcing Framework

```
IDEAL CUSTOMER PROFILE (ICP) DEFINITION
═══════════════════════════════════════════════════════════

Firmographic Criteria (Primary Filters):
  ┌──────────────────────────┬──────────────────────────────────────────┐
  │ Attribute                │ Target Range / Values                    │
  ├──────────────────────────┼──────────────────────────────────────────┤
  │ Employee count           │ 50–1,000 (adjust by product/price point) │
  │ Annual revenue           │ $10M–$500M                              │
  │ Company type             │ Private, VC-funded, Public               │
  │ Industry (NAICS)         │ 51–53 (Information), 54 (Professional),  │
  │                          │ 45–49 (Wholesale Trade)                  │
  │ Headquarters location    │ US, Canada, UK, Western Europe           │
  │ Years in business        │ 5+ years (established)                   │
  │ Funding status           │ Series A+ or profitable (bootstrapped)   │
  │ Total funding raised     │ $5M+ (growth-stage companies)            │
  └──────────────────────────┴──────────────────────────────────────────┘

Technographic Criteria (Secondary Filters):
  → Using complementary technology (indicates tech sophistication):
     - CRM: Salesforce, HubSpot Enterprise, Microsoft Dynamics
     - Marketing: Marketo, HubSpot Marketing, Pardot
     - Analytics: Google Analytics 4, Mixpanel, Amplitude
     - Cloud: AWS, Azure, Google Cloud Platform
     - Communication: Slack, Microsoft Teams, Zoom
  → Using competitor products (displacement opportunity):
     - [Competitor A]: Identify companies on their customer page
     - [Competitor B]: Detect via BuiltWith / Datanyze
     - [Competitor C]: Identify from G2 / Capterra reviews
  → NOT using category solution (greenfield opportunity):
     - Gap in their stack indicates unmet need

Behavioral Criteria (Intent Signals):
  → Website engagement: Visited pricing, demo, or product pages
  → Content consumption: Downloaded whitepapers, attended webinars
  → Job postings: Hiring roles related to solution (e.g., "Head of Growth")
  → Employee growth: Headcount increased >15% in last 6 months
  → Funding event: Raised capital in last 90 days
  → Technology changes: Added complementary tools in last 30 days

Exclusion Criteria (Hard Filters):
  → Competitor companies (maintain blocklist of 20–50 competitors)
  → Government / public sector (if not target market)
  → Non-profit organizations (if for-profit product)
  → Companies with <50 employees (below minimum viable customer)
  → Companies in sanctioned countries or restricted regions
  → Existing customers (prevent duplicate outreach)
  → Churned customers (unless winback campaign)
```

## Data Source Configuration & Search Strategy

```
DATA SOURCE COMPARISON & CONFIGURATION
═══════════════════════════════════════════════════════════

ZoomInfo (Enterprise Grade):
  Database size: 50M+ companies, 500M+ contacts
  Data freshness: Updated within 72 hours (claimed)
  Coverage: 95% of Fortune 500, 85% of US private companies
  Key features:
    → Direct-dial phone numbers: 40M+ (industry leader)
    → Personal email addresses: 500M+
    → Company technographics: 2M+ companies with stack data
    → Organizational charts: 100K+ companies
    → Intent data (ZoomInfo Intent): 30K+ accounts with buying signals
    → Job change alerts: Real-time employee movement detection
  Pricing: $15K–$50K/year (Enterprise plans, per-seat licensing)
  Integration: Salesforce, HubSpot, Outreach, SalesLoft, Zapier
  API access: Yes (REST API for custom integrations)
  Search capabilities: 40+ filters, boolean operators, saved searches
  Recommended for: Enterprise sales teams requiring highest data accuracy

Apollo.io (Mid-Market Leader):
  Database size: 26M+ companies, 200M+ contacts, 300B+ data points
  Data freshness: Updated within 7 days (claimed)
  Coverage: 80% of US companies, growing global coverage
  Key features:
    → Email finder and verifier: Built-in email verification
    → Phone numbers: 13M+ direct dials
    → Email sequences: Built-in outreach platform (750 emails/day free)
    → Intent data: 10K+ accounts with buying signals
    → Technology detection: 400K+ companies
    → Company events: Funding, hiring, technology changes
  Pricing: Free tier, $49/month (Basic), $99/month (Professional), $199/month (Business)
  Integration: Salesforce, HubSpot, Slack, Zapier
  API access: Yes (REST API, webhooks)
  Search capabilities: 30+ filters, boolean operators, list building
  Recommended for: Mid-market teams wanting cost-effective all-in-one platform

LinkedIn Sales Navigator (Relationship Intelligence):
  Database size: 1B+ professionals, 70M+ companies
  Data freshness: Real-time (user-updated profiles)
  Coverage: 95% of decision-makers on the platform
  Key features:
    → Advanced search: 15+ filters (title, company, industry, tenure)
    → Lead recommendations: AI-powered prospect suggestions
    → Account alerts: Changes at target companies (hires, departures)
    → InMail credits: Direct messaging to non-connections
    → TeamLink: Find mutual connections across your team
    → Lead Builder: Guided search interface
  Pricing: $99/month (Professional), $179/month (Team), $229/month (Enterprise)
  Integration: Salesforce, HubSpot, Outreach, SalesLoft
  API access: Limited (LinkedIn API has strict usage limits)
  Search capabilities: Advanced filters, saved searches, boolean operators
  Recommended for: Relationship-driven selling, account-based approaches

Lusha / Snov.io (Email Verification Specialists):
  Lusha:
    → Database: 300M+ contacts, 25M+ companies
    → Chrome extension: In-browser contact finding
    → Email verification: 95%+ accuracy claimed
    → Pricing: $39/month (Starter), $69/month (Growth), $129/month (Scale)
  Snov.io:
    → Database: 150M+ contacts, 35M+ companies
    → Email drip campaigns: Built-in outreach
    → Domain search: Find all contacts at a company
    → Pricing: $29/month (Basic), $59/month (Pro), $99/month (Business)

SEARCH STRATEGY BY SOURCE:
  Priority 1 (ZoomInfo + Apollo): Build core prospect list
    → Run ICP search with firmographic + technographic filters
    → Export: Company + contact records (max records per search)
    → Expected results: 5,000–50,000 companies (depends on ICP narrowness)

  Priority 2 (LinkedIn Sales Navigator): Enrich with relationship data
    → Load ZoomInfo/Apollo results as target accounts
    → Search for contacts at target accounts
    → Identify mutual connections and engagement history
    → Send connection requests with personalized notes

  Priority 3 (Lusha/Snov.io): Verify and supplement email data
    → Verify emails from ZoomInfo/Apollo (cross-check accuracy)
    → Find additional contacts not in primary databases
    → Check email deliverability before outreach
```

## Lead List Building & Deduplication Process

```
LEAD LIST BUILDING WORKFLOW
═══════════════════════════════════════════════════════════

Step 1 — Define Search Parameters (30 minutes):
  → Select data source: ZoomInfo (primary), Apollo (secondary)
  → Apply firmographic filters: Employee count, revenue, industry, location
  → Apply technographic filters: Complementary tools, competitor products
  → Apply exclusion filters: Competitors, existing customers, non-target
  → Save search for re-running (updates as new companies match)
  → Export results: CSV with company + contact fields

Step 2 — Initial Data Review (30 minutes):
  → Review result count: Expected 5,000–50,000 companies
  → Spot-check 10–20 records: Verify data accuracy (company exists, contact role correct)
  → Check data completeness: % records with email, phone, company details
  → Flag incomplete records: Missing critical fields (email, company name)
  → Remove obviously invalid records: Personal email domains, fake phone numbers

Step 3 — Deduplication (15 minutes, automated):
  → Deduplication rules:
     - Company dedup: Match by domain name (company.com)
     - Contact dedup: Match by email address
     - Fuzzy match: Allow 1–2 character differences (typos)
  → Cross-source dedup: Compare ZoomInfo results with Apollo results
     - Prefer ZoomInfo data (higher accuracy for phone numbers)
     - Use Apollo data for missing contacts or newer records
  → Existing CRM dedup: Compare against Salesforce/HubSpot database
     - Exclude existing contacts and accounts
     - Flag "update needed" for contacts with changed roles/companies
  → Expected reduction: 15–30% of initial records removed

Step 4 — Lead Scoring & Prioritization (15 minutes, automated):
  → Score each lead 0–100 based on:
     - ICP fit score (0–40 points): How closely company matches ICP
     - Data completeness (0–20 points): % of fields populated
     - Intent signals (0–20 points): Recent funding, hiring, tech changes
     - Relationship proximity (0–20 points): Mutual connections, prior engagement
  → Tier leads by score:
     - Tier 1 (80+): 10–20% of list — immediate outreach
     - Tier 2 (60–79): 30–40% of list — standard campaign
     - Tier 3 (40–59): 30–40% of list — nurture campaign
     - Tier 4 (<40): 5–10% of list — deprioritize or discard
  → Expected outcome: 500–2,000 Tier 1 leads ready for outreach

Step 5 — CRM Import & Campaign Setup (30 minutes):
  → Map fields to CRM schema:
     - Company: Name, website, industry, employee count, revenue
     - Contact: Name, title, email, phone, LinkedIn profile
     - Custom: Lead source, ICP fit score, data source, import date
  → Create campaign record in CRM: "[Campaign Name] — [Date]"
  → Import records via:
     - Salesforce Data Loader / HubSpot Import Tool
     - Automated API sync (ZoomInfo Direct Connect, Apollo API)
  → Assign leads to reps:
     - Round-robin assignment (equal distribution)
     - Territory-based assignment (by geography or industry)
     - Skill-based assignment (by industry expertise)
  → Trigger outreach sequence:
     - Multi-channel cadence begins within 2 hours of import
     - First touch: Email or LinkedIn connection request

Step 6 — Ongoing List Management:
  → Monthly refresh: Re-run saved searches for new matches
  → Quarterly ICP review: Adjust criteria based on win/loss data
  → Data decay monitoring: Flag records older than 12 months
  → Bounce rate tracking: Remove invalid emails (target: <5% bounce rate)
  → List hygiene: Monthly cleanup of duplicates and invalid records
```

## Lead Sourcing Performance Metrics

```
LEAD SOURCING DASHBOARD
═══════════════════════════════════════════════════════════

Weekly Metrics:
  → New leads sourced: [number] (target: 200–500/week)
  → Leads imported to CRM: [number] (target: >90% of sourced)
  → Data completeness rate: [%] (target: >85% with email + phone)
  → Deduplication rate: [%] (target: <20% duplicates)
  → ICP fit rate: [%] (target: >70% of leads match ICP)

Monthly Metrics:
  → Total leads sourced: [number] (target: 800–2,000/month)
  → Tier 1 leads: [number] (target: 15–25% of total)
  → Outreach initiated: [%] of Tier 1 leads within 48 hours (target: >90%)
  → Email deliverability: [%] (target: >95%, bounce rate <5%)
  → Meeting booked rate: [%] of contacted leads (target: >5%)

Revenue Attribution:
  → Pipeline generated: $[amount] from sourced leads
  → Closed revenue: $[amount] from sourced leads (last 12 months)
  → Average deal size: $[amount] (sourced vs. other sources)
  → Cost per lead: $[amount] (data source cost / leads sourced)
  → Cost per opportunity: $[amount] (data source cost / opportunities)
  → ROI: [%] (revenue / data source cost)

Data Quality Scorecard:
  → Email accuracy: [%] verified (target: >90%)
  → Phone accuracy: [%] valid direct dials (target: >75%)
  → Company data freshness: [%] updated within 30 days (target: >85%)
  → Contact role accuracy: [%] confirmed titles (target: >80%)
  → Data enrichment rate: [%] records with >80% fields (target: >70%)

Source Comparison:
  ┌──────────────────┬──────────┬──────────┬──────────┬────────────┐
  │ Source           │ Records  │ Accuracy │ Cost/Rec │ Conv Rate  │
  ├──────────────────┼──────────┼──────────┼──────────┼────────────┤
  │ ZoomInfo         │ 50,000   │ 92%      │ $2.50    │ 8.5%       │
  │ Apollo           │ 30,000   │ 85%      │ $0.50    │ 6.2%       │
  │ LinkedIn Nav     │ 10,000   │ 95%      │ $1.20    │ 12.3%      │
  │ Lusha            │ 15,000   │ 88%      │ $0.80    │ 5.8%       │
  │ Snov.io          │ 20,000   │ 83%      │ $0.40    │ 5.1%       │
  └──────────────────┴──────────┴──────────┴──────────┴────────────┘
```

## Integration Points

- **ZoomInfo Direct Connect**: Bi-directional sync with Salesforce — auto-enrich account and contact records, push firmographic and technographic data in real-time
- **Apollo API**: Automated lead list building and export — programmatically query database, filter by ICP, export to CSV or push directly to CRM
- **LinkedIn Sales Navigator**: Saved searches and alerts — set up automated lead recommendations, account change notifications, TeamLink mutual connection discovery
- **Salesforce / HubSpot**: CRM integration — lead import, deduplication rules, assignment rules, campaign tracking, data enrichment
- **Lusha / Snov.io Chrome Extension**: Browser-based contact finding — click-to-find contacts on any webpage, verify emails before outreach
- **NeverBounce / ZeroBounce**: Email verification — batch verify email lists before import, reduce bounce rates to <3%
- **Slack**: Lead alert notifications — new high-scoring leads posted to dedicated channel with company details and recommended outreach
- **ZoomInfo Intent / Bombora**: Intent data overlay — flag accounts actively researching solution categories, prioritize for immediate outreach

## Edge Cases

- **ICP too broad** (>50,000 matching companies): List is too large for effective outreach
  → Narrow by technographic criteria (add complementary tool requirement)
  → Narrow by revenue band (focus on specific revenue range)
  → Add geographic filter (focus on top 5–10 metro areas)
  → Add firmographic filter (employee count, funding status, industry sub-segment)
  → Target: 5,000–15,000 companies for manageable outbound campaigns

- **ICP too narrow** (<500 matching companies): Not enough leads for sustainable pipeline
  → Expand industry list (add adjacent NAICS codes)
  → Widen employee count range (include smaller or larger companies)
  → Remove secondary technographic filters (keep primary only)
  → Add international markets (EMEA, APAC) if product supports
  → Consider "adjacent ICP": companies that are 80% match but have unique characteristics

- **Data quality degradation**: Contact data becomes stale (job changes, email bounces)
  → Monthly data refresh: Re-enrich existing contacts via API
  → Bounce rate monitoring: Remove hard-bounced emails immediately
  → Job change detection: Flag contacts who changed roles (via ZoomInfo alerts)
  → Target maintenance: Keep data freshness >90% through monthly enrichment cycles
  → Budget: 10–15% of data source spend on monthly enrichment

- **Competitor company overlap**: Competitors appear in ICP search results
  → Maintain competitor blocklist (20–50 company names/domains)
  → Automated filtering: Apply blocklist before CRM import
  → Review quarterly: Add new competitors to blocklist
  → Exception: Some "competitors" may be in different segments — review manually

- **Contact data privacy compliance**: GDPR, CCPA, and CAN-SPAM compliance
  → GDPR (EU contacts): Legitimate interest basis must be documented, opt-out mechanism required
  → CCPA (California): Right to opt-out of data sale/sharing, deletion request handling
  → CAN-SPAM: Valid business address in emails, opt-out link, honor opt-outs within 10 business days
  → Best practice: Source data from legitimate B2B databases (they handle compliance)
  → Never purchase lists from unverified sources (data quality and legal risk)
  → Maintain data processing records for compliance audits

- **Small company universe** (niche market): Only 200–500 companies match ICP
  → Source ALL matching companies (comprehensive coverage strategy)
  → Supplement with manual research: LinkedIn, industry directories, event attendee lists
  → Focus on quality over quantity: Deep personalization for each prospect
  → Multi-touch strategy: 8–12 touches per prospect over 60 days
  → Account-based approach: Treat each company as a named account with dedicated plan

- **Existing customer in lead list**: Sourced leads include current customers
  → Automated exclusion: Cross-reference against CRM customer database before import
  → Exception handling: Flag "customer" leads for upsell/cross-sell campaign instead of new-business campaign
  → Customer data enrichment: Use sourced data to update existing CRM records
  → Revenue opportunity: Calculate expansion potential for each existing customer in list
