HASHIRU: Hierarchisches Agentensystem für hybride intelligente Ressourcennutzung

papers.abstract

Die rasanten Fortschritte bei Large Language Models (LLMs) treiben die Entwicklung autonomer Multi-Agenten-Systeme (MAS) voran. Allerdings mangelt es aktuellen Frameworks oft an Flexibilität, Ressourcenbewusstsein, Modellvielfalt und autonomer Werkzeugerstellung. Dieses Paper stellt HASHIRU (Hierarchical Agent System for Hybrid Intelligent Resource Utilization) vor, ein neuartiges MAS-Framework, das Flexibilität, Ressourceneffizienz und Anpassungsfähigkeit verbessert. HASHIRU verfügt über einen „CEO“-Agenten, der spezialisierte „Mitarbeiter“-Agenten dynamisch verwaltet, die basierend auf Aufgabenanforderungen und Ressourcenbeschränkungen (Kosten, Speicher) instanziiert werden. Seine hybride Intelligenz priorisiert kleinere, lokale LLMs (über Ollama), nutzt jedoch bei Bedarf flexibel externe APIs und größere Modelle. Ein ökonomisches Modell mit Einstellungs- und Entlassungskosten fördert Teamstabilität und effiziente Ressourcenallokation. Das System umfasst auch die autonome Erstellung von API-Werkzeugen und eine Speicherfunktion. Bewertungen bei Aufgaben wie der Begutachtung akademischer Paper (58 % Erfolgsrate), Sicherheitsbewertungen (100 % auf einer JailbreakBench-Teilmenge) und komplexem logischem Denken (Überlegenheit gegenüber Gemini 2.0 Flash bei GSM8K: 96 % vs. 61 %; JEEBench: 80 % vs. 68,3 %; SVAMP: 92 % vs. 84 %) demonstrieren die Fähigkeiten von HASHIRU. Fallstudien zeigen seine Selbstverbesserung durch autonome Kostenmodellgenerierung, Werkzeugintegration und Budgetverwaltung. HASHIRU bietet einen vielversprechenden Ansatz für robustere, effizientere und anpassungsfähigere MAS durch dynamische hierarchische Kontrolle, ressourcenbewusste hybride Intelligenz und autonome funktionale Erweiterung. Der Quellcode und Benchmarks sind unter https://github.com/HASHIRU-AI/HASHIRU bzw. https://github.com/HASHIRU-AI/HASHIRUBench verfügbar, und eine Live-Demo kann auf Anfrage unter https://hashiruagentx-hashiruai.hf.space genutzt werden.

English

Rapid Large Language Model (LLM) advancements are fueling autonomous Multi-Agent System (MAS) development. However, current frameworks often lack flexibility, resource awareness, model diversity, and autonomous tool creation. This paper introduces HASHIRU (Hierarchical Agent System for Hybrid Intelligent Resource Utilization), a novel MAS framework enhancing flexibility, resource efficiency, and adaptability. HASHIRU features a "CEO" agent dynamically managing specialized "employee" agents, instantiated based on task needs and resource constraints (cost, memory). Its hybrid intelligence prioritizes smaller, local LLMs (via Ollama) while flexibly using external APIs and larger models when necessary. An economic model with hiring/firing costs promotes team stability and efficient resource allocation. The system also includes autonomous API tool creation and a memory function. Evaluations on tasks like academic paper review (58% success), safety assessments (100% on a JailbreakBench subset), and complex reasoning (outperforming Gemini 2.0 Flash on GSM8K: 96% vs. 61%; JEEBench: 80% vs. 68.3%; SVAMP: 92% vs. 84%) demonstrate HASHIRU's capabilities. Case studies illustrate its self-improvement via autonomous cost model generation, tool integration, and budget management. HASHIRU offers a promising approach for more robust, efficient, and adaptable MAS through dynamic hierarchical control, resource-aware hybrid intelligence, and autonomous functional extension. Source code and benchmarks are available at https://github.com/HASHIRU-AI/HASHIRU and https://github.com/HASHIRU-AI/HASHIRUBench respectively, and a live demo is available at https://hashiruagentx-hashiruai.hf.space upon request.

HASHIRU: Hierarchisches Agentensystem für hybride intelligente Ressourcennutzung

HASHIRU: Hierarchical Agent System for Hybrid Intelligent Resource Utilization

papers.abstract

Support