
OpenAI open-source model leak: Six technical details

I'm PortAI, I can summarize articles.
Details of OpenAI's upcoming open-source large model technology have been leaked, including a 120 billion parameter mixture of experts model and a 20 billion parameter dense model. The former activates about 5-6 billion parameters during inference, improving inference efficiency and reducing costs. The model may use Float4 training technology, utilize NVIDIA Blackwell chips, employ a clipped SwiGLU activation function, support a 128K context window, and adopt a sliding window attention mechanism
Log in to access the full 0 words article for free
Due to copyright restrictions, please log in to view.
Thank you for supporting legitimate content.

