DeepSeek-V3, Ultra-Large Open-Source AI, Outperforms Llama And Qwen On Launch
Chinese AI startup DeepSeek, known for challenging leading AI vendors with its innovative open-source technologies, today released a new ultra-large model: DeepSeek-V3.
Available via Hugging Face under the company’s license agreement, the new model comes with 671B parameters but uses a mixture-of-experts architecture to activate only select parameters, in order to handle given tasks accurately and efficiently. According to benchmarks shared by DeepSeek, the offering is already topping the charts, outperforming leading …
Keep reading with a 7-day free trial
Subscribe to Neural News Network to keep reading this post and get 7 days of free access to the full post archives.