Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference

  • Thread starter Thread starter benchmarkist
  • Start date Start date
Back
Top