Topic: computer use
-
Google's New AI Browses the Web Like a Human
Google has launched Gemini 2.5 Computer Use, an AI model that mimics human web browsing to automate interactions with websites lacking API access, such as completing online forms. This technology excels in user interface testing and digital navigation, building on prior agent-driven projects like...
Read More » -
Claude Sonnet 4.5 Launches to Power Next-Gen AI Agents
Anthropic has launched Claude Sonnet 4.5, an AI model capable of 30 hours of autonomous operation, demonstrated by independently coding a functional chat app with 11,000 lines. The model is positioned as the world's leading AI for real-world agents and coding, excelling in sectors like cybersecur...
Read More » -
Scale Document Analysis with Vision Language Models
Vision Language Models (VLMs) merge visual and textual interpretation, enabling advanced document analysis by understanding the interplay between text placement and imagery. VLMs excel in tasks requiring visual context, such as identifying checked documents or interpreting screen contents, where ...
Read More »