Metaforecast

Question description #

The transformer architecture was introduced in the landmark 2017 machine learning paper Attention is All You Need. Previously, many researchers believed that the attention mechanism was among the most promising research directions for improving sequence-to-sequence models. Writing in 2015, Christopher Olah remarked,

LSTMs were a big step in what we can accomplish with RNNs. It’s natural to wonder: is there another big step? A common opinion among researchers is: “Yes! There is a next step and it’s attention!”

This prediction turned out to be correct. Transformers are generally considered to have unseated LSTM at competitive language modeling, and their central operating principle is using the attention mechanism.

Indicators #

Indicator	Value
Stars	★★★☆☆
Platform	Metaculus
Number of forecasts	163

Capture #

Resizable preview:

Will transformer derived architectures still be state of the art for language modeling in December 2025?

85%

Likely

Last updated: 2024-10-07

The transformer architecture was introduced in the landmark 2017 machine learning paper Attention is All You Need. Previously, many researchers believed that the attention mechanism was among the most promising research directions for improving...

Last updated: 2024-10-07

★★★☆☆

Metaculus

Forecasts: 163

Embed #

Preview

‌

Will transformer derived architectures still be state of the art for language modeling in December 2025?

Question description #

Indicators #

Capture #

Embed #