By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
inkeinspires.cominkeinspires.cominkeinspires.com
Notification Show More
Font ResizerAa
  • Home
  • Breaking News
    Breaking NewsShow More
    19 Virginia sheriffs endorse Miyares over Democrat Jones in attorney general race
    June 27, 2025
    China battery giant CATL is expanding globally: Here’s why it matters
    June 27, 2025
    What to know about the Supreme Court birthright citizenship case
    June 27, 2025
    Matcha is having a moment — and it’s putting pressure on Japan’s tea industry
    June 27, 2025
    Americans detained trying to send rice, Bibles, dollar bills to North Korea | Politics News
    June 27, 2025
  • Business
    BusinessShow More
    US stocks hit record high as markets recover from Trump tariff shock
    June 27, 2025
    Renewables leaders parse the damage to their industry as Senate finalizes vote on ‘big beautiful bill’
    June 27, 2025
    ‘India needs its own Big Four’: Grant Thornton Bharat CEO Vishesh Chandiok calls for homegrown audit giants
    June 27, 2025
    Jefferson Capital, existing shareholders raise $150 million in US IPO
    June 27, 2025
    Trump says the U.S. and China have signed a trade deal but doesn’t offer any details
    June 27, 2025
  • Entertainment
    EntertainmentShow More
    Inside Kevin Spacey’s ‘Substantial’ Hollywood Return
    June 27, 2025
    12 Best Movies Like M3GAN
    June 27, 2025
    Paige DeSorbo Refuses to Travel Without This $44 Moisturizer
    June 27, 2025
    The Netflix Hit Comes To An Underwhelming Conclusion
    June 27, 2025
    Brad Pitt’s Los Angeles Home Burglarized During F1 Promotion
    June 27, 2025
  • Gadgets
    GadgetsShow More
    CES 2025: 41 Products You Can Buy Right Now
    January 13, 2025
    I can’t wait try out these 3 great plant tech gadgets that I saw at CES 2025
    January 13, 2025
    6 on Your Side Consumer Confidence: Kitchen gadgets to upgrade family recipes – ABC 6 News
    January 13, 2025
    35+ Best New Products, Tech and Gadgets
    January 13, 2025
    These gadgets kept me connected and working through a 90-mile backpacking trip
    January 13, 2025
  • Health
    HealthShow More
    A New Study Finds An 8-Hour Eating Window May Help Burn Fat—But Is It Safe? inkeinspires
    June 27, 2025
    184: Crafting a Morning Routine That Works For YOU inkeinspires
    June 26, 2025
    Endurance Exercise and Longevity – BionicOldGuy inkeinspires
    June 26, 2025
    How Zone 2 Cardio Can Burn Fat And Boost Longevity inkeinspires
    June 26, 2025
    What to do when an exercise doesn’t feel right inkeinspires
    June 25, 2025
  • Sports
    SportsShow More
    Sri Lanka ODI squad vs Bangladesh announced, Matheesha Pathirana dropped
    June 27, 2025
    Rohit Sharma reveals the unsung hero behind India’s T20 World Cup 2024 triumph
    June 27, 2025
    Keyshawn Davis Under Fire: Fans Blast “Truth Will Reveal Itself” Apology After Missed Weight & Stripped Title
    June 27, 2025
    Unfortunate update on Roman Reigns following WWE return rumours
    June 27, 2025
    Pep Guardiola hails Rodri return after Manchester City midfielder makes first start since September 2024 at Club World Cup | Football News
    June 27, 2025
  • Technology
    TechnologyShow More
    Early Prime Day deals include our favorite mesh Wi-Fi router for a record-low price
    June 27, 2025
    Best Smart Home Safes for 2025: We Cracked the Code
    June 27, 2025
    Mattress Shopping Terms to Know (2025)
    June 27, 2025
    Meta in talks to acquire voice cloning startup Play AI
    June 27, 2025
    Best Internet Providers in Fresno, California
    June 27, 2025
  • Posts
    • Post Layouts
    • Gallery Layouts
    • Video Layouts
    • Audio Layouts
    • Post Sidebar
    • Review
      • User Rating
    • Content Features
    • Table of Contents
  • Contact US
  • Pages
    • Blog Index
    • Search Page
    • Customize Interests
    • My Bookmarks
    • 404 Page
Reading: MiniMax unveils open source LLM with staggering 4M token context
Share
Font ResizerAa
inkeinspires.cominkeinspires.com
  • Entertainment
Search
  • Home
  • Categories
    • Breaking News
    • Business
    • Sports
    • Technology
    • Entertainment
    • Gadgets
    • Health
  • Contact
Have an existing account? Sign In
Follow US
inkeinspires.com > Technology > MiniMax unveils open source LLM with staggering 4M token context
Technology

MiniMax unveils open source LLM with staggering 4M token context

MTHANNACH
Last updated: January 15, 2025 12:00 am
MTHANNACH Published January 15, 2025
Share
SHARE

Join our daily and weekly newsletters for the latest updates and exclusive content covering cutting-edge AI. Learn more


MiniMax is perhaps best known here in the United States today as the Singapore company behind Hailuo, a realistic, high-resolution generative AI video model that rivals Runway, OpenAI’s Sora, and Luma AI’s Dream Machine.

But the company has many more tricks up its sleeve: today, for example, it announced the release and open source of the MiniMax-01 seriesa new family of models designed to handle ultra-long contexts and improve the development of AI agents.

The series includes MiniMax-Text-01, a large base language model (LLM), and MiniMax-VL-01a multimodal visual model.

A massive pop-up

The LLM, MiniMax-Text-o1, is particularly notable for allowing up to 4 million tokens in its pop-up, which is equivalent to a small library full of books. The pop-up window indicates the amount of information the LLM can handle in an input/output exchangewith words and concepts represented as digital “tokens,” the LLM’s own internal mathematical abstraction of the data it has trained on.

And while Google previously led the pack with its Gemini 1.5 Pro model and 2 Million Tokens PopupMiniMax has somehow doubled that figure!

Like MiniMax posted on his official X account today: “MiniMax-01 efficiently processes up to 4 million tokens, which is 20 to 32 times the capacity of other leading models. We believe MiniMax-01 is poised to support the anticipated increase in agent-related applications in the coming year as agents increasingly require extensive context management capabilities and memory sustained.

They are now available for download on Cuddly face And GitHub under a personalized MiniMax licenseso users can try it directly Chat with Hailuo AI (a competitor to ChatGPT/Gemini/Claude), and via MiniMax Application Programming Interface (API)where third-party developers can attach their own unique apps to it.

MiniMax offers text processing and multimodal APIs at competitive prices:

  • $0.2 for 1 million entry tokens
  • $1.1 for 1 million exit tokens

For comparison, OpenAI’s GPT-4o costs $2.50 for 1 million entry tokens thanks to its API, 12.5 times more expensive.

MiniMax has also integrated a Mixture of Experts (MoE) framework with 32 experts to optimize scalability. This design balances compute and memory efficiency while maintaining competitive performance on key tests.

Innovate with Lightning Attention Architecture

At the heart of the MiniMax-01 is the Lightning Attention mechanism, an innovative alternative to the traditional Transformer architecture.

This design significantly reduces computational complexity. The models include 456 billion parameters, of which 45.9 billion are inference-enabled.

Unlike previous architectures, Lightning Attention uses a mix of linear and traditional SoftMax layers, achieving near-linear complexity for long inputs. SoftMaxfor those new to the concept like me, are the transformation of input digits into probabilities totaling 1, so that the LLM can approximate the meaning of the most likely input.

MiniMax has rebuilt its training and inference frameworks to support the Lightning Attention architecture. Key improvements include:

  • Optimizing MoE all-to-all communication: Reduces inter-GPU communication overhead.
  • Be careful, Varlen’s ring: minimizes computer waste for processing long sequences.
  • Efficient kernel implementations: Customized CUDA cores improve Lightning Attention performance.

These advancements make MiniMax-01 models accessible to real-world applications while remaining affordable.

Performance and references

On consumer text and multimodal tests, MiniMax-01 rivals leading models like GPT-4 and Claude-3.5, with particularly strong results on long-context assessments. Notably, MiniMax-Text-01 achieved 100% accuracy on the Needle-In-A-Haystack task with a context of 4 million tokens.

The models also demonstrate minimal performance degradation as input length increases.

MiniMax plans regular updates to expand the models’ capabilities, including code and multimodal improvements.

The company views open source as a step toward creating foundational AI capabilities for the evolving AI agent landscape.

As 2025 promises to be a transformative year for AI agents, the need for durable memory and effective inter-agent communication increases. MiniMax innovations are designed to address these challenges.

Open to collaboration

MiniMax invites developers and researchers to explore the capabilities of MiniMax-01. Beyond open source, its team welcomes technical suggestions and collaboration requests on model@minimaxi.com.

With its commitment to cost-effective and scalable AI, MiniMax is positioned as a key player in shaping the era of AI agents. The MiniMax-01 series offers developers an exciting opportunity to push the boundaries of what long-context AI can achieve.

Daily insights into business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you insight into what companies are doing with generative AI, from regulatory changes to practical deployments, so you can share insights for maximum ROI.

Read our privacy policy

Thank you for subscribing. Check out more VB newsletters here.

An error has occurred.


You Might Also Like

These researchers used NPR Sunday Puzzle questions to benchmark AI ‘reasoning’ models

Best Internet Providers in Ontario, California

Best Electric Kettles of 2025

Tom Cruise gears up to save us from AI in the latest Mission: Impossible – The Final Reckoning trailer

Gunzilla launches $GUN Web3 gaming token on Binance for Off the Grid

Share This Article
Facebook X Email Print
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Subscribe to Our Newsletter
Subscribe to our newsletter to get our newest articles instantly!
loader

Email Address*

Name

Follow US

Find US on Social Medias
FacebookLike
XFollow
YoutubeSubscribe
TelegramFollow

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!
[mc4wp_form]
Popular News
Business

TikTok shouts out Trump as app goes dark for millions of users across US

MTHANNACH MTHANNACH January 19, 2025
What were the most recent commercial aviation accidents in the US?
Lightspeed’s $2 Billion Anthropic Megadeal Cements VC Firm’s AI Ambitions
Preity Zinta beams with joy after PBKS’ thrilling victory against CSK in IPL 2025
Best Internet Providers in Long Beach, California
- Advertisement -
Ad imageAd image
Global Coronavirus Cases

Confirmed

0

Death

0

More Information:Covid-19 Statistics

Categories

  • Business
  • Breaking News
  • Entertainment
  • Technology
  • Health
  • Sports
  • Gadgets
We influence 20 million users and is the number one business and technology news network on the planet.
Quick Link
  • My Bookmark
  • InterestsNew
  • Contact Us
  • Blog Index
Top Categories
  • Entertainment

Subscribe US

Subscribe to our newsletter to get our newest articles instantly!

 

All Rights Reserved © Inkinspires 2025
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?