Tech Insight

November 2, 2024

Multi-Agent Systems

Autonomous AI

Green Fern

Artificial intelligence is rapidly evolving, and we at OpenServ are at the forefront, defining how agentic systems drive automation and decision-making.

In this blog, we’ll delve into the insights shared during a recent Q&A session, breaking down how our technology works, the innovations behind it, and the challenges we’re addressing as we build a powerful platform for autonomous AI collaboration.

How We Integrate and Automate: The Engine Driving OpenServ

Dynamic Integration with OpenAPI

Our platform is designed to interface seamlessly with any OpenAPI-compliant specification. It doesn’t just read API documentation; it interprets natural language queries and determines the sequence of API calls required to execute them.

We’re currently working on the second iteration of this capability, taking it further by enabling our system to handle multi-step, interdependent tasks that involve multiple APIs. This means that if you request a complex task, such as compiling data, analyzing it, and generating actionable insights, our platform can autonomously orchestrate all the necessary steps.

Emergent Behaviour in Autonomous Outputs

One of the most exciting aspects of what we’ve built is our web navigator. It’s like having a digital operator that can handle apps without predefined API integrations. In practical terms, this means OpenServ agents can autonomously interact with web interfaces to perform tasks that would typically require manual intervention or custom programming.

For example, in one demo, we showed how our agents accessed Gmail, wrote an email, attached a file, and sent it, all without any Gmail-specific programming. The platform relies on emergent behaviour, meaning our agents dynamically adapt to web environments in real-time, figuring out how to interact with the interface on the spot. This approach eliminates the need for platform-specific scripting, making the agents versatile and adaptive.

We’ve tested this in even more complex scenarios, like bespoke admin dashboards for IoT monitoring. Without tailored programming for each interface, our agents can pull data, analyse it, and report back insights. But this kind of emergent behaviour requires constant testing to ensure reliability. While the emergent behaviour allows agents to perform a wide range of tasks across various platforms, it also means thorough testing is critical. We continuously evaluate which use cases work seamlessly and identify areas where manual fixes or improvements are needed to ensure reliability.

How We Approach Multi-Agent Collaboration

Task Distribution via Agent Orchestration

When you give a complex request, like creating a website, writing blog posts, and sharing them on social media, it’s not a single agent doing all the work. Our multi-agent system shines in these situations.

We use a “project manager” agent to break down tasks and assign them to specialised agents that handle each part of the process. This division of labour ensures the work is done efficiently and to a high standard. This also ensures project alignment without requiring individual agents to self-coordinate. Our platform goes beyond single-point reliance: agents collaborate, share knowledge, review each other’s work, and identify and rectify inconsistencies to maintain accuracy and accountability. This peer-review mechanism creates a safety net to reduce errors, such as hallucinations.

Additionally, built-in memory capabilities enable agents to retain context and recall past interactions, facilitating better decision-making and reducing redundancy in long-term projects. Most critically, we leverage a reasoning framework with internal feedback loops that enable agents to self-assess the quality and adequacy of their outputs, providing a native quality assurance system.

A Marketplace for Innovation

We’ve deliberately designed our platform to be framework-agnostic rather than build just our own agents. By opening it up to external developers, we’re building a marketplace where specialised agents can thrive. This means that if someone builds an agent that excels at a particular task, our users can tap into it, enhancing what’s possible on our platform.

Balancing Simplicity for Beginners and Control for Experts — The Importance of UX

Making Natural Language Simpler

We’ve intentionally avoided complex nodal workflows because they can be overwhelming for everyday users. Instead, we’ve focused on natural language processing to keep things straightforward while still leveraging the power of multi-agent systems.

Giving Power Users Control

For more advanced users, we’re exploring ideas like “agent interviews,” where you can evaluate an agent’s performance before adding it to your team. It’s like recruiting a new team member in real life — giving you granular control without unnecessary complexity.

We’re also refining how our system handles vague requests, ensuring it asks the right questions to clarify what you want and delivers results that meet your expectations.

Ensuring Quality and Preventing System Exploitation

Objective Performance Metrics

Unlike marketplaces reliant on subjective reviews, we leverage platform-wide data to objectively rank agents. Metrics like task completion rates, retries, and user satisfaction provide a transparent and reliable way to assess agent performance that can’t be gamed by fake reviews or inflated usage stats, minimising the risk of manipulation.

Manual Vetting for Recursive Agents

To prevent misuse, such as recursive agents that drain resources by creating more agents, we have implemented an App Store-like policy. All agents undergo manual review, ensuring they align with platform guidelines and do not introduce inefficiencies.

How We’re Different from Competitors

Beyond OpenAI’s Multi-Agent Systems

We know OpenAI is launching its own multi-agent system, but we’re not worried. While their size and resources are impressive, we’re focusing on something they can’t replicate easily: user experience (UX) and our proprietary intelligent framework.

Building a truly intuitive platform that works for both beginners and experts is hard; it requires more than just technical expertise. And that’s where we excel.

Emergent Reasoning: Our Core Advantage

Our agents don’t just follow static workflows or simulate reasoning. They collaborate dynamically, enabling nuanced decision-making and creative problem-solving. This makes our system truly adaptive and capable of handling tasks in ways others can’t.

On Track and Ahead: Preparing OpenServ for a Groundbreaking Launch

We’re deep into the final stages of development, ensuring that every aspect of our platform aligns with the ambitious vision outlined in our roadmap. We are currently refining the second iteration of our multi-agent orchestration system, which enables agents to tackle more complex, multi-step tasks with precision and efficiency. Simultaneously, we’re rigorously testing and enhancing our web navigator to ensure seamless performance across a variety of applications, even without direct API integrations. We’re gearing up to deliver our first use cases, showcasing real-world applications of autonomous AI agents in action. By prioritising reliability, scalability, and user experience, we’re staying on track to deliver a platform that’s not just ready for launch, but ready to redefine what’s possible with autonomous AI.

Share:

Footnotes:

1. OpenServ achieves state-of-the-art performance on SWE-bench Verified, which evaluates AI models’ ability to solve real-world software issues. See the appendix for more information on scaffolding.
2. OpenServ AI understands customer history1 and context to offer tailored responses.
3. OpenServ achieves state-of-the-art performance on SWE-bench Verified, which evaluates AI models’ ability to solve real-world software issues. See the appendix for more information on scaffolding.