INDUSTRIES

Arbisoft is your one-stop shop when it comes to your eLearning needs. Our Ed-tech services are designed to improve the learning experience and simplify educational operations.
Discover More
- "Working with Arbisoft has felt less like hiring a vendor and more like gaining a team of trusted colleagues. Their developers don’t just build what we ask, they think alongside us, offer smart suggestions, and care deeply about getting it right."
  Sarah Johnson / SVP of Product, Summit K12
Get cutting-edge travel tech solutions that cater to your users’ every need. We have been employing the latest technology to build custom travel solutions for our clients since 2007.
Discover More
- “Arbisoft has been my most trusted technology partner for now over 15 years. Arbisoft has very unique methods of recruiting and training, and the results demonstrate that. They have great teams, great positive attitudes and great communication.”
  Paul English / Co-Founder, KAYAK
As a long-time contributor to the healthcare industry, we have been at the forefront of developing custom healthcare technology solutions that have benefitted millions.
Discover More
- "I wanted to tell you how much I appreciate the work you and your team have been doing of all the overseas teams I've worked with, yours is the most communicative, most responsive and most talented."
  Matt Hasel / Program Manager, eHuman
We take pride in meeting the most complex needs of our clients and developing stellar fintech solutions that deliver the greatest value in every aspect.
Discover More
- “Arbisoft is an integral part of our team and we probably wouldn't be here today without them. Some of their team has worked with us for 5-8 years and we've built a trusted business relationship. We share successes together.”
  Jake Peters / CEO & Co-Founder, PayPerks
Unlock innovative solutions for your e-commerce business with Arbisoft’s seasoned workforce. Reach out to us with your needs and let’s get to work!
Discover More
- "The development team at Arbisoft is very skilled and proactive. They communicate well, raise concerns when they think a development approach wont work and go out of their way to ensure client needs are met."
  Veronika Sonsev / Co-Founder
Arbisoft is a holistic technology partner, adept at tailoring solutions that cater to business needs across industries. Partner with us to go from conception to completion!
Discover More
- “The app has generated significant revenue and received industry awards, which is attributed to Arbisoft’s work. Team members are proactive, collaborative, and responsive”.
  Silvan Rath / CEO, Predict.io

AI Model Compression Part V: The LSTM: A New Kind of Memory

Ateeb TaseerPosted on January 1, 2025

2-3 Min Read Time

After overcoming the initial struggles with maintaining context over time, the next leap in AI’s memory capabilities came with the development of the Long Short-Term Memory (LSTM) architecture, which added a new level of sophistication to how machines retain and process information.

Birth of the Long Short-Term Memory

In 1997, Hochreiter and Schmidhuber introduced the Long Short-Term Memory (LSTM) architecture. But to understand its genius, let's first understand the human memory system it mirrors:

Human Memory System          LSTM Gates
------------------          ----------
Attention Filter        →   Input Gate
                           (what to remember)

Working Memory          →   Memory Cell
                           (current state)

Memory Consolidation   →   Forget Gate
                           (what to forget)

Memory Retrieval       →   Output Gate
                           (what to use)

Explore how memory systems in AI evolved in Part IV of this series.

The mathematics of the LSTM tells this story of selective memory:

Input Gate:
i_t = σ(W_i[h_(t-1), x_t] + b_i)

Forget Gate:
f_t = σ(W_f[h_(t-1), x_t] + b_f)

Memory Cell:
c_t = f_t ⊙ c_(t-1) + i_t ⊙ tanh(W_c[h_(t-1), x_t] + b_c)

Output Gate:
o_t = σ(W_o[h_(t-1), x_t] + b_o)

Each equation represents a crucial aspect of conscious memory, forming the backbone of deep learning solutions that drive today's intelligent systems.

The Input Gate decides what new information is worth remembering
The Forget Gate determines what old memories can fade
The Memory Cell maintains the current state of understanding
The Output Gate chooses what memories are relevant now

The Dance of Memory: How LSTMs Learn

Think of an LSTM as a master storyteller, constantly deciding:

Which details to emphasize
Which to let fade
How to connect distant events
When to recall earlier information

This mirrors how human consciousness works with memory:

Example: Reading a Mystery Novel

Human Process          LSTM Process
-------------         ------------
Note key clues    →   Input Gate activates
                      for important information

Hold suspects     →   Memory Cell maintains
in mind               key details

Discard red       →   Forget Gate removes
herrings              irrelevant information

Connect final     →   Output Gate retrieves
clues                 stored information

Just published

img-https://d1foa0aaimjyw4.cloudfront.net/Inside_Alibaba_s_Qwen3_AI_Models_How_They_Compare_to_Claude_Opus_4_Arbisoft_1_f07ca5a2f4.png

Inside Alibaba’s Qwen3 AI Models: How They Compare to Claude Opus 4Read more

img-https://d1foa0aaimjyw4.cloudfront.net/ML_Project_Team_in_2025_83294a53cc.png

What Are the Advantages and Disadvantages of Hiring a Dedicated AI/ML Project Team in 2025?Read more

img-https://d1foa0aaimjyw4.cloudfront.net/What_Team_Model_Is_Ideal_for_Early_Stage_vs_Enterprise_AI_Projects_fca90d6097.png

What Team Model Is Ideal for Early-Stage vs. Enterprise AI Projects?Read more

...Loading Related Blogs

Explore More

Have Questions? Let's Talk.

We have got the answers to your questions.

NPS

INDUSTRIES

Real-time Maintenance Reporting

Workflow Automation Platform

Recruitment Automation Tool

Learner Engagement Platform

Customer Feedback Analytics

School Communication Suite

Digital Learning Suite

Software Development Outsourcing

Dedicated Teams

IT Staff Augmentation

New Venture Partnership

AI Model Compression Part V: The LSTM: A New Kind of Memory

Birth of the Long Short-Term Memory

The Dance of Memory: How LSTMs Learn

Example: Reading a Mystery Novel

Just published

Have Questions? Let's Talk.

More from Ateeb Taseer

DeepSeek Popped the AI Bubble, So What Comes Next?

AI Model Compression PART IX: The Final Frontier: Ultimate Compression...

AI Model Compression Part VIII: The Compression Revolution - Finding P...

Just published

AI Model Compression Part V: The LSTM: A New Kind of Memory

Birth of the Long Short-Term Memory

The Dance of Memory: How LSTMs Learn

Example: Reading a Mystery Novel

Just published

Have Questions? Let's Talk.

Newsletter

More from Ateeb Taseer

DeepSeek Popped the AI Bubble, So What Comes Next?

AI Model Compression PART IX: The Final Frontier: Ultimate Compression...

AI Model Compression Part VIII: The Compression Revolution - Finding P...

Just published