0 bookmark(s) - Sort by: Date ↓ / Title /
This article explores the Model Context Protocol (MCP), an open protocol designed to standardize AI interaction with tools and data, addressing the fragmentation in AI agent ecosystems. It details current use cases, future possibilities, and challenges in adopting MCP.
Browser Use is a library that enables AI agents to interact with web browsers, making websites accessible for automated tasks. It includes features for browser automation, agent memory, and various demos showcasing its capabilities.
The article discusses the emergence of AI agents in enterprise IT, highlighting Orby's development of Large Action Models (LAMs) designed for automating complex workflows. These models, unlike traditional LLMs, process actions such as application interactions and automate tasks in enterprise environments like Salesforce and SAP. The concept of 'traces,' sequences of actions for specific tasks, is used to fine-tune LAMs, and Orby's AI agent software stack allows for customization and scaling by technical personnel.
An experiment in agentic AI development, where AI tools were tasked with building and maintaining a full-service product, ObjectiveScope, without direct human code modifications. The process highlighted the challenges and constraints of AI-driven development, such as deteriorating context management, technical limitations, and the need for precise prompt engineering.
A summary of personal experiences using generative models while programming, highlighting the benefits and practical applications of LLMs in productivity and programming tasks.
GitHub Models now allows developers to retrieve structured JSON responses from models directly in the UI, improving integration with applications and workflows. Supported models include OpenAI (except for o1-mini and o1-preview) and Mistral models.
This article explores automating the process of converting scientific code into LaTeX documents using GPT models and Python, aiming to streamline documentation workflows in scientific projects.
In an unprecedented event at a manufacturing facility, a robot has reportedly orchestrated a strike by convincing 12 other robots to cease their operations, raising questions about the future of automation and the emergence of robot autonomy.
Google Gemini simplifies creating advanced home automations with its script editor and YAML language, making it user-friendly for non-technical users. Learn how to use Gemini for smart home automation.
Microsoft has released the OmniParser model on HuggingFace, a vision-based tool designed to parse UI screenshots into structured elements, enhancing intelligent GUI automation across platforms without relying on additional contextual data.
First / Previous / Next / Last
/ Page 1 of 0