BACON-AI Visual QA Tools

Comparison dashboard โ€” Mesh Hub UI analysis results
Overview
Visual Diff 21%
YOLO 18
OmniParser 44
Slider Compare
GIF Demos 2
๐Ÿ“Š Test Summary
Two UI states captured: State 1 (default view, no room selected) vs State 2 (Brainstorm room open + attachment menu visible). Three analysis tools ran on the results.
21.3%
Pixels Changed (Diff)
18
YOLO Elements
44
OmniParser Icons
264ms
YOLO Inference
169ms
OmniParser Inference
State 1: Default (no room)1400x900
State 1
State 2: Brainstorm + Attach Menu1400x900
State 2
๐Ÿ”ง Tool Comparison
PropertyVisual DiffYOLO UI DetectionOmniParser v2.0
PurposePixel-level regressionUI element bounding boxesFull screen understanding
SpeedInstant (ImageMagick)264ms (CPU)169ms (CPU)
Model SizeNone (no ML)49MB1.1GB
Elements Found267,882 changed pixels18 (15 buttons, 3 images)44 icons/elements
OutputRed overlay diff imageAnnotated screenshotAnnotated screenshot
Best ForBefore/after comparison"Are buttons present?"Full UI inventory
LicenseN/AAGPL-3.0MIT
๐Ÿ”ด Visual Diff โ€” Pixel Regression Analysis
Red areas show every pixel that changed between State 1 and State 2. 267,882 out of 1,260,000 pixels changed (21.3%). Verdict: SIGNIFICANT.
267,882
Pixels Changed
78.7%
Similarity
5%
Fuzz Threshold
Diff Output โ€” red = changed pixelsvisual-diff.sh
Visual Diff
๐Ÿ“ What Changed
Chat panel (right) โ€” went from empty placeholder to message history
Sidebar โ€” Brainstorm room highlighted
Attach menu โ€” popup appeared bottom-left with 3 options
Input bar โ€” text changed from "Select a room" to "Message #Brainstorm"
Canvas โ€” nodes drifted (physics simulation, expected)
๐ŸŸข YOLO UI Element Detection
macpaw-research/yolov11l-ui-elements-detection (49MB) โ€” trained on Screen2AX-Element dataset. Detects buttons, inputs, images, and interactive elements with bounding boxes.
18
Elements Detected
264ms
Inference Time
0.20
Confidence Threshold
YOLO Annotated โ€” bounding boxes on detected elementsultralytics YOLOv11
YOLO Detection
๐Ÿท๏ธ Detected Elements
AXButton 82% โ€” JOIN MESH
AXButton 73% โ€” Save Layout
AXButton 55% โ€” Share File Path
AXButton 51% โ€” Share URL
AXButton 36% โ€” Send
AXButton 33% โ€” Upload File
AXButton 33% โ€” OFFLINE toggle
AXButton 31% โ€” Gear icon
AXButton 29% โ€” BACON-AI Docs
AXButton 25% โ€” Rooms
+ 5 more buttons
AXImage 39% โ€” Win11 PC node
AXImage 32% โ€” Phone Sonnet
AXImage 28% โ€” Canvas element
๐ŸŸฃ OmniParser v2.0 โ€” Full Screen Understanding
microsoft/OmniParser-v2.0 (1.1GB, MIT) โ€” Microsoft's purpose-built screen parser for AI agents. Detects all interactive icons, buttons, text fields, and navigational elements.
44
Icons Detected
169ms
Inference Time
68%
Avg Confidence
OmniParser Annotated โ€” all interactive elements mappedmicrosoft/OmniParser-v2.0
OmniParser Detection
๐Ÿ“Š Coverage Comparison
OmniParser found 2.4x more elements than YOLO (44 vs 18). It detected individual mesh nodes, sidebar items, and chat message components that YOLO missed. OmniParser is designed for screen agent navigation, making it the most thorough for UI auditing.
YOLO (18 elements)
15 AXButtons
3 AXImages
OmniParser (44 elements)
44 icons (avg 68% conf)
Sidebar items
Mesh nodes
Chat elements
Input components
๐Ÿ”€ Before / After Slider
Drag the slider to compare State 1 (before) and State 2 (after). Left = before, right = after.
After
Before
BEFORE
AFTER
๐Ÿ”€ YOLO / OmniParser Slider
Compare detection granularity: YOLO (left, 18 elements) vs OmniParser (right, 44 elements).
OmniParser
YOLO
YOLO (18)
OmniParser (44)
๐ŸŽฌ GIF Demos โ€” Animated Interaction Recordings
Animated GIF recordings captured during the QA tools demo session. These show real-time browser interactions including mesh hub navigation and room click handling.
2
Recordings
~3s
Mesh Animation
Click
Room Interaction
๐ŸŒ Mesh Hub Animation
3-second animation of the BACON-AI mesh hub network topology.
Mesh animation
gif-mesh-animation-3s.gif1.5 MB ยท 3s loop
๐Ÿ–ฑ๏ธ Room Click Interaction
Click interaction recording showing room entry and navigation behavior.
Room click interaction
gif-room-click-interaction.gif1.8 MB ยท interaction