Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deqing 's Collections
Fourier Language Model
Convergent Evolution
Convergent Evolution (Addition)
Convergent Evolution (Architecture and Optimizer)
Convergent Evolution (Data)

Convergent Evolution (Architecture and Optimizer)

updated 6 days ago
Upvote
-

  • deqing/convergent-llama-300M-muon-original

    Text Generation • 0.3B • Updated 18 days ago • 806

  • deqing/convergent-gdn-300M-muon-original

    Text Generation • 0.3B • Updated 18 days ago • 378

  • deqing/convergent-mamba2-300M-muon-original

    Text Generation • 0.3B • Updated 18 days ago • 380

  • deqing/convergent-lstm-4layer-muon-original

    Text Generation • 0.2B • Updated 18 days ago • 434

  • deqing/convergent-lstm-12layer-muon-original

    Text Generation • 0.2B • Updated 18 days ago • 388

  • deqing/convergent-llama-300M-adamw-original

    Text Generation • 0.3B • Updated 18 days ago • 553

  • deqing/convergent-gdn-300M-adamw-original

    Text Generation • 0.3B • Updated 18 days ago • 388

  • deqing/convergent-mamba2-300M-adamw-original

    Text Generation • 0.3B • Updated 18 days ago • 504
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs