MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU

  • Thread starter Thread starter chrsw
  • Start date Start date
Back
Top