Subscribe
Sign in
OmniParser V2

OmniParser V2

Turn any LLM into a Computer Use Agent
1 review
338 followers

What is OmniParser V2?

OmniParser ‘tokenizes’ UI screenshots from pixel spaces into structured elements in the screenshot that are interpretable by LLMs. This enables the LLMs to do retrieval based next action prediction given a set of parsed interactable elements.

Do you use OmniParser V2?

OmniParser V2 gallery image
OmniParser V2 gallery image
OmniParser V2 gallery image

Recent OmniParser V2 Launches

OmniParser V2

Forum Threads

View all

Review OmniParser V2?

5/5 based on 1 review

Reviews

Victor Dibia
1 review
Enables base models in UI understanding.