i'm sorry, but the 1M context window sounds good only on paper in reality, it's more like a 400-500k context window that advertised 1M is the biggest lie i've seen. the model breaks down well before that point -- it doesn't forget, but it starts to malfunction completely
33,31K