菜单

关于 🐙 GitHub
arXiv 提交日期: 2026-04-14
📄 Abstract - Multi-Agent Object Detection Framework Based on Raspberry Pi YOLO Detector and Slack-Ollama Natural Language Interface

The paper presents design and prototype implementation of an edge based object detection system within the new paradigm of AI agents orchestration. It goes beyond traditional design approaches by leveraging on LLM based natural language interface for system control and communication and practically demonstrates integration of all system components into a single resource constrained hardware platform. The method is based on the proposed multi-agent object detection framework which tightly integrates different AI agents within the same task of providing object detection and tracking capabilities. The proposed design principles highlight the fast prototyping approach that is characteristic for transformational potential of generative AI systems, which are applied during both development and implementation stages. Instead of specialized communication and control interface, the system is made by using Slack channel chatbot agent and accompanying Ollama LLM reporting agent, which are both run locally on the same Raspberry Pi platform, alongside the dedicated YOLO based computer vision agent performing real time object detection and tracking. Agent orchestration is implemented through a specially designed event based message exchange subsystem, which represents an alternative to completely autonomous agent orchestration and control characteristic for contemporary LLM based frameworks like the recently proposed OpenClaw. Conducted experimental investigation provides valuable insights into limitations of the low cost testbed platforms in the design of completely centralized multi-agent AI systems. The paper also discusses comparative differences between presented approach and the solution that would require additional cloud based external resources.

顶级标签: multi-modal agents systems
详细标签: object detection edge computing multi-agent system llm interface raspberry pi 或 搜索:

基于树莓派YOLO检测器与Slack-Ollama自然语言接口的多智能体目标检测框架 / Multi-Agent Object Detection Framework Based on Raspberry Pi YOLO Detector and Slack-Ollama Natural Language Interface


1️⃣ 一句话总结

这篇论文提出并实现了一种在资源受限的树莓派硬件上,通过自然语言聊天界面(Slack-Ollama)来协调控制多个AI智能体(包括YOLO视觉检测器)协同工作的新型边缘目标检测系统原型。

源自 arXiv: 2604.13345