How does beam search work in language models?

Question

How does beam search work in language models?

1 Answer

rahuljain1 · Answer 1 · 2024-06-25T16:39:18+0000

Beam search represents a popular decoding strategy employed in language generation tasks, such as text generation and machine translation. At its core, beam search operates by iteratively expanding a set of candidate sequences, known as the beam, and selecting the most promising candidates based on a predefined scoring criterion. By exploring multiple potential sequences in parallel and retaining a fixed number of top candidates (the beam width), beam search enables language models to generate coherent and contextually relevant text while balancing between exploration and exploitation.

How does beam search work in language models?

Please log in or register to answer this question.

1 Answer