B
benchmarkist
Article URL: Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference - Cerebras
Comments URL: Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference | Hacker News
Points: 415
# Comments: 133
Continue reading...
Comments URL: Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference | Hacker News
Points: 415
# Comments: 133
Continue reading...