Version:
classimplements VisionFinderInterface
VercelAiSdkVisionFinder
Vision finder implementation powered by Vercel AI SDK.
The provider uses an LLM vision model to decide whether screen content matches
a natural language description and optionally returns one or more matching
regions.
typescript
class VercelAiSdkVisionFinder implements VisionFinderInterfaceMethods
findMatch
findMatch(matchRequest: MatchRequest<{ by: { description: string }; id: string; type: "vision" }, PROVIDER_DATA_TYPE>): Promise<MatchResult<Region>>Finds the single best region matching the given vision query.
Parameters
matchRequestMatchRequest<{ by: { description: string }; id: string; type: "vision" }, PROVIDER_DATA_TYPE>The match request containing the needle descriptionand haystack image.
Returns
Promise<MatchResult<Region>>findMatches
findMatches(matchRequest: MatchRequest<{ by: { description: string }; id: string; type: "vision" }, PROVIDER_DATA_TYPE>): Promise<MatchResult<Region>[]>Finds all regions matching the given vision query, up to the configured
maximum.
Parameters
matchRequestMatchRequest<{ by: { description: string }; id: string; type: "vision" }, PROVIDER_DATA_TYPE>The match request containing the needle descriptionand haystack image.
Returns
Promise<MatchResult<Region>[]>Was this page helpful?