Synthetic Query Generation using Large Language Models for Virtual Assistants

This paper was accepted in the Industry Track at SIGIR 2024.

Virtual Assistants (VAs) are important Information Retrieval platforms that help users accomplish various tasks through spoken commands. The speech recognition system (speech-to-text) uses query priors, trained solely on text, to distinguish between phonetically confusing alternatives. Hence, the generation of synthetic queries that are similar to existing VA usage can greatly improve upon the VA’s abilities-especially for use-cases that do not (yet) occur in paired audio/text data.

In this paper, we provide a preliminary exploration of the use of Large Language Models (LLMs) to generate synthetic queries that are complementary to template-based methods. We investigate whether the methods (a) generate queries that are similar to user queries from a popular VA, and (b) whether the generated queries are specific. We find that LLMs generate more verbose queries, compared to template-based methods, and reference aspects specific to the entity. The generated queries are similar to VA user queries, and are specific enough to retrieve the relevant entity. We conclude that queries generated by LLMs and templates are complementary.

Source link

Pelajari AI SEKARANG Sebelum Terlambat! (Artificial Intelligence) #LightBites

BONE: A Unifying Machine Learning Framework for Methods that Perform Bayesian Online Learning in Non-Stationary Environments

AI Chatbot Founder Charged with Fraud

Google’s research on quantum error correction

Mastering the Art of Hyperparameter Tuning: Tips, Tricks, and Tools

How Real-Time Data Analytics and AI Are Transforming Heavy Equipment Operations

State of the Art Artificial Intelligence! – Tiny AI (Just a test)

A model of virtuosity | MIT News

Synthetic Query Generation using Large Language Models for Virtual Assistants

Related News