Identifiers
An identifier is a piece of data that may uniquely or partially identify a real-world entity. The engine accepts the following types:| Type | Description | Format requirement |
|---|---|---|
email | Email address | RFC 5321 |
phone | Phone number | E.164 (e.g. +15551234567) |
username | Platform-agnostic username | Any |
ip | IPv4 or IPv6 address | Standard notation |
domain | Registered domain name | RFC 1035 |
name | Full or partial name | Any |
url | Web URL for scraping or analysis | URL |
address | Physical or mailing address | Any |
company_name | Business or organization name | Any |
social_url | Direct URL to a social media profile | URL |
Entity resolution
When multiple identifiers are submitted, the engine attempts to determine whether they all belong to the same real-world person or organization. The result is communicated viaresolution.status:
| Status | Meaning |
|---|---|
resolved | All identifiers map to a single entity with high confidence. |
ambiguous | Identifiers match more than one candidate entity. All candidates are returned ranked by confidence. The caller should review and select the correct match. |
no_match | No entity could be associated with any of the provided identifiers. |
status is ambiguous, the response contains multiple entries in the entities array, each with its own confidence score. Do not assume the first entry is correct; review each candidate.
Confidence scores
Confidence is expressed as a float between0.0 and 1.0. It appears at three levels:
resolution.confidence: how certain the engine is that the returned entities represent the queried identifiersentity.confidence: how certain the engine is that a specific candidate entity matches (only meaningful whenstatusisambiguous)- Per data-point confidence: every phone, address, social profile, and associate entry carries its own confidence score reflecting source quality and corroboration
| Range | Interpretation |
|---|---|
| 0.90—1.0 | High confidence. Multiple independent sources agree. |
| 0.70—0.89 | Moderate confidence. Some corroboration but gaps remain. |
| 0.50—0.69 | Low confidence. Single source or conflicting signals. |
| < 0.50 | Speculative. Treat with significant caution. |
Source coverage and partial results
The engine queries all configured connectors in parallel. If a connector is unreachable (e.g. due to a firewall egress rule), the query still completes with results from the connectors that were available. Thesource_coverage object in every response discloses which connectors were queried and which were not:
available list means that source was not checked for this query.
Use GET /connectors to inspect connector status before running critical queries.
Lookup depth
Thedepth option controls how broadly the engine queries sources:
| Depth | Behavior |
|---|---|
standard | Queries primary connectors. Covers the majority of use cases with the lowest latency. |
deep | Queries all available connectors including secondary and derived sources. Higher latency and connector usage. Use when standard coverage is insufficient. |