About 12,900 results
Open links in new tab
  1. Qwen/Qwen3-4B-Base · Hugging Face

    Qwen3-4B-Base Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

  2. Qwen/Qwen3-30B-A3B-Base · Hugging Face

    Qwen3-30B-A3B-Base Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

  3. black-forest-labs/FLUX.2-klein-base-9B · Hugging Face

    FLUX.2 [klein] 9B Base is a 9 billion parameter rectified flow transformer capable of generating images from text descriptions and supports multi-reference editing capabilities. It's a full-capacity foundation …

  4. Qwen/Qwen3-8B · Hugging Face

    # Use the endpoint provided by Alibaba Model Studio: # 'model_type': 'qwen_dashscope', # 'api_key': os.getenv('DASHSCOPE_API_KEY'), # Use a custom endpoint compatible with OpenAI API: …

  5. answerdotai/ModernBERT-base · Hugging Face

    Dec 19, 2024 · On GLUE, ModernBERT-base surpasses other similarly-sized encoder models, and ModernBERT-large is second only to Deberta-v3-large. For general retrieval tasks, ModernBERT …

  6. unsloth/FLUX.2-klein-base-9B-GGUF · Hugging Face

    FLUX.2 [klein] 9B Base is a 9 billion parameter rectified flow transformer capable of generating images from text descriptions and supports multi-reference editing capabilities. It's a full-capacity foundation …

  7. google-t5/t5-base · Hugging Face

    T5-Base is the checkpoint with 220 million parameters. Developed by: Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu.