透過程式綜合實現自然語言命令
Natural Language Commanding via Program Synthesis
June 6, 2023
作者: Apurva Gandhi, Thong Q. Nguyen, Huitian Jiao, Robert Steen, Ameya Bhatawdekar
cs.AI
摘要
我們提出了語義解釋器(Semantic Interpreter),這是一種針對生產力軟體(如Microsoft Office)的自然語言友好型人工智慧系統,利用大型語言模型(LLMs)來執行用戶意圖跨應用程式功能。雖然LLMs在理解以自然語言表達的用戶意圖方面表現出色,但對於實現需要超出文本轉換的應用程式特定用戶意圖來說並不足夠。因此,我們引入了Office領域特定語言(ODSL),這是一種簡潔、高層次的語言,專門用於在Office應用程式中執行操作並與實體進行交互。語義解釋器利用分析檢索提示構建方法與LLMs進行程式合成,將自然語言用戶發話翻譯為可以轉譯為應用程式API並隨後執行的ODSL程式。我們主要討論了針對Microsoft PowerPoint的研究探索。
English
We present Semantic Interpreter, a natural language-friendly AI system for
productivity software such as Microsoft Office that leverages large language
models (LLMs) to execute user intent across application features. While LLMs
are excellent at understanding user intent expressed as natural language, they
are not sufficient for fulfilling application-specific user intent that
requires more than text-to-text transformations. We therefore introduce the
Office Domain Specific Language (ODSL), a concise, high-level language
specialized for performing actions in and interacting with entities in Office
applications. Semantic Interpreter leverages an Analysis-Retrieval prompt
construction method with LLMs for program synthesis, translating natural
language user utterances to ODSL programs that can be transpiled to application
APIs and then executed. We focus our discussion primarily on a research
exploration for Microsoft PowerPoint.