By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
inkeinspires.cominkeinspires.cominkeinspires.com
Notification Show More
Font ResizerAa
  • Home
  • Breaking News
    Breaking NewsShow More
    Son of Norway’s crown princess charged with rape, sexual assault – National
    June 27, 2025
    Brazil’s outspoken first lady comes under fire, but refuses to stop speaking out
    June 27, 2025
    2 charged with murder after bride shot dead, groom and 13-year-old nephew wounded at wedding party in France
    June 27, 2025
    Political violence is quintessentially American | Donald Trump
    June 27, 2025
    19 Virginia sheriffs endorse Miyares over Democrat Jones in attorney general race
    June 27, 2025
  • Business
    BusinessShow More
    Canara Bank hands over Rs 2,283 cr dividend to Centre amid record profits, joins SBI, BoB in robust payouts
    June 27, 2025
    Foreign stocks are crushing US shares, even with the new record high
    June 27, 2025
    Videos reveal driving issues with Tesla’s robotaxi fleet in Austin
    June 27, 2025
    US stocks hit record high as markets recover from Trump tariff shock
    June 27, 2025
    Renewables leaders parse the damage to their industry as Senate finalizes vote on ‘big beautiful bill’
    June 27, 2025
  • Entertainment
    EntertainmentShow More
    Terminator’s Forgotten First Attempt To Save Itself
    June 27, 2025
    Meghan Markle’s $658 Weekender Tote Look Is $36 on Amazon
    June 27, 2025
    Armed Elderly Woman Blocks Texas Highway In 5-Hour Standoff
    June 27, 2025
    Inside Kevin Spacey’s ‘Substantial’ Hollywood Return
    June 27, 2025
    12 Best Movies Like M3GAN
    June 27, 2025
  • Gadgets
    GadgetsShow More
    CES 2025: 41 Products You Can Buy Right Now
    January 13, 2025
    I can’t wait try out these 3 great plant tech gadgets that I saw at CES 2025
    January 13, 2025
    6 on Your Side Consumer Confidence: Kitchen gadgets to upgrade family recipes – ABC 6 News
    January 13, 2025
    35+ Best New Products, Tech and Gadgets
    January 13, 2025
    These gadgets kept me connected and working through a 90-mile backpacking trip
    January 13, 2025
  • Health
    HealthShow More
    A New Study Finds An 8-Hour Eating Window May Help Burn Fat—But Is It Safe? inkeinspires
    June 27, 2025
    184: Crafting a Morning Routine That Works For YOU inkeinspires
    June 26, 2025
    Endurance Exercise and Longevity – BionicOldGuy inkeinspires
    June 26, 2025
    How Zone 2 Cardio Can Burn Fat And Boost Longevity inkeinspires
    June 26, 2025
    What to do when an exercise doesn’t feel right inkeinspires
    June 25, 2025
  • Sports
    SportsShow More
    Lyon included in Ligue 1 fixtures despite demotion to Ligue 2, and receive Europa League clearance
    June 27, 2025
    Brentford appoint former Wolves midfielder Andrews as boss
    June 27, 2025
    Real Betis still hopeful over ‘very complex’ deal for Manchester United’s Antony
    June 27, 2025
    Sri Lanka ODI squad vs Bangladesh announced, Matheesha Pathirana dropped
    June 27, 2025
    Rohit Sharma reveals the unsung hero behind India’s T20 World Cup 2024 triumph
    June 27, 2025
  • Technology
    TechnologyShow More
    US Supreme Court Upholds Texas Porn ID Law
    June 27, 2025
    SCOTUS porn ruling opens door to sweeping internet age verification
    June 27, 2025
    Early Prime Day deals include our favorite mesh Wi-Fi router for a record-low price
    June 27, 2025
    Best Smart Home Safes for 2025: We Cracked the Code
    June 27, 2025
    Mattress Shopping Terms to Know (2025)
    June 27, 2025
  • Posts
    • Post Layouts
    • Gallery Layouts
    • Video Layouts
    • Audio Layouts
    • Post Sidebar
    • Review
      • User Rating
    • Content Features
    • Table of Contents
  • Contact US
  • Pages
    • Blog Index
    • Search Page
    • Customize Interests
    • My Bookmarks
    • 404 Page
Reading: A look under the hood of transfomers, the engine driving AI model evolution
Share
Font ResizerAa
inkeinspires.cominkeinspires.com
  • Entertainment
Search
  • Home
  • Categories
    • Breaking News
    • Business
    • Sports
    • Technology
    • Entertainment
    • Gadgets
    • Health
  • Contact
Have an existing account? Sign In
Follow US
inkeinspires.com > Technology > A look under the hood of transfomers, the engine driving AI model evolution
Technology

A look under the hood of transfomers, the engine driving AI model evolution

MTHANNACH
Last updated: February 15, 2025 9:03 pm
MTHANNACH Published February 15, 2025
Share
SHARE

Join our daily and weekly newsletters for the latest updates and the exclusive content on AI coverage. Learn more


Today, almost all of peak AI products and models use transformer architecture. The models of large languages ​​(LLM) such as GPT-4O, Llama, Gemini and Claude are all based on transformers, and other AI applications such as text text, automatic speech recognition, generation generation Images and video text models have transformers such as their underlying technology.

With the media threw around AI that can be slowed down as soon as it is, it is time to give transformers their due, that is why I would like to explain a little how they work, why they are so important for the growth of solutions evolving and why they are the backbone of LLMS.

Transformers are more than who meet the eye

In short, a transformer is an architecture of a neural network designed to model data sequences, which makes them ideal for tasks such as language translation, completion of sentences, automatic speech recognition and more. Transformers have really become the dominant architecture for many of these sequence modeling tasks, because the underlying attention mechanism can be easily parallelized, allowing a massive scale during training and inference.

Originally introduced in a 2017 article, “Attention is all you need“Google researchers, the transformer was introduced as an encoder architecture specially designed for the translation of language. The following year, Google published representations of Bidirectional Coder of Transformers (Bert), which could be considered one of the first LLM – although it is now considered small according to today’s standards.

Since then – and above all accelerated with the advent of OPENAI GPT models – the trend has been to form increasingly large models with more data, more parameters and longer context windows.

To facilitate this evolution, there have been many innovations such as: more more advanced GPU equipment and better software for multi-GPU training; techniques such as quantification and mixture of experts (MOE) to reduce memory consumption; New optimizing for training, such as shampoo and Adamw; Techniques for effectively calculating attention, such as flashatting and KV chatter. The trend will probably continue in the foreseeable future.

The importance of self -expectation in transformers

According to the application, a model of transformer follows a encoder architecture. The component of the encoder learns a vector representation of data which can then be used for downstream tasks such as classification and analysis of feelings. The decoder component takes a vector or a latent representation of the text or the image and uses it to generate a new text, which makes it useful for tasks such as the completion of the sentence and the summary. For this reason, many peak familiar models, such as the GPT family, are only a decoder.

Coder models combine the two components, which makes them useful for translation and other sequence sequence tasks. For encoder and decoder architectures, the central component is the layer of attention, because this is what allows a model to keep the context of words that appear much earlier in the text.

Attention presents itself in two flavors: self -attention and cross -attention. Self-tensioning is used to capture relationships between words in the same sequence, while cross attention is used to capture relationships between words through two different sequences. Crossed attention connects the components of the encoder and the decoder in a model and during the translation. For example, it allows the English word “strawberries” to relate to the French word “flora”. Mathematically, self-attentive and cross-attention are different forms of matrix multiplication, which can be made extremely effectively using a GPU.

Due to the layer of attention, transformers can better capture relationships between words separated by long quantities of text, while previous models such as recurring neural networks (RNN) and long -term models in short Term (LSTM) lose track of the context of words earlier in the text.

The future of models

Currently, transformers are the dominant architecture for many use cases that require LLM and benefit from the greatest research and development. Although this does not seem likely to change anytime soon, a different class of model that has recently acquired interest is the models of state space (SSM) such as Mamba. This very effective algorithm can manage very long data sequences, while transformers are limited by a context window.

For me, the most exciting applications of transformer models are multimodal models. The OPENAI GPT -4O, for example, is able to manage text, audio and images – and other suppliers are starting to follow. Multimodal applications are very diverse, ranging from video subtitling to vocal cloning to images segmentation (and more). They also have the opportunity to make AI more accessible to disabled people. For example, a blind person could be greatly served by the ability to interact through vocal and audio components of a multimodal application.

It is an exciting space with a lot of potential to discover new use cases. But remember that, at least in the predictable future, are largely supported by the transformative architecture.

Terrence Alsup is a data scientist greater than Finastra.

DATADECISIONMAKERS

Welcome to the Venturebeat community!

Data data manufacturers are the place where experts, including technicians who do data work, can share data -related information and innovation.

If you want to read on advanced ideas and up-to-date information, best practices and the future of data and data technology, join us at datadecisionmakers.

You could even consider contributing your own article!

Learn more about datadecisionmakers


You Might Also Like

SpaceX Starship spirals out of control in second straight test flight failure

The best early deals already live, dates and everything else you need to know

McDonald’s Snack Wrap: When Is the Viral Food Favorite Returning? June or July?

Nvidia Turns an April Fool’s Joke Into a Real AI Assistant for PC Gaming

Niantic and Capcom will launch update of Monster Hunter Now tied to Monster Hunter Wilds

Share This Article
Facebook X Email Print
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Subscribe to Our Newsletter
Subscribe to our newsletter to get our newest articles instantly!
loader

Email Address*

Name

Follow US

Find US on Social Medias
FacebookLike
XFollow
YoutubeSubscribe
TelegramFollow

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!
[mc4wp_form]
Popular News
Entertainment

Invincible Season 3 Episode 7’s Major Death Is Heartbreaking

MTHANNACH MTHANNACH March 6, 2025
Blue state’s abortion-pill law harms women by depriving follow-up care: pro-life docs
Welcome to the new age of geoeconomics
Federal Reserve faces threat from US consumers’ soaring inflation expectations
Sudan’s military says it has reclaimed presidential palace from rebels
- Advertisement -
Ad imageAd image
Global Coronavirus Cases

Confirmed

0

Death

0

More Information:Covid-19 Statistics

Categories

  • Business
  • Breaking News
  • Entertainment
  • Technology
  • Health
  • Sports
  • Gadgets
We influence 20 million users and is the number one business and technology news network on the planet.
Quick Link
  • My Bookmark
  • InterestsNew
  • Contact Us
  • Blog Index
Top Categories
  • Entertainment

Subscribe US

Subscribe to our newsletter to get our newest articles instantly!

 

All Rights Reserved © Inkinspires 2025
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?