How Agentic Architectures Can Improve LLMs’ Effectiveness

Narendran Sivakumar

Global GTM Lead, Generative AI, Corporate Technology Office, TCS

What we do

HIGHLIGHTS

LLMs today can mimic many aspects of human cognition, but they falter when it comes to memory and prioritization.
LLMs’ attention mechanisms that often compress information for streamlined processing, and their dense clustering of language features, could be holding them back.
Agentic architectures have the potential to transform LLMs into adaptive reasoning agents with human-like cognitive processes.
Incorporating adaptive memory and prioritization mechanisms that align more closely with human cognitive strengths can help LLMs retain critical details.

‘AI’ on reasoning

The race for intelligent, human-like AI has intensified.

The recent release of DeepSeek’s reasoning model R1 was met with rivals like OpenAI and Meta promising better reasoning models. Large language models (LLMs) have evolved to mimic many aspects of human cognition, including planning, analytical reasoning, language processing, and even creative problem solving. But challenges remain, particularly around memory and prioritization. LLMs struggle to retain and selectively recall critical information. Let’s see why and how agentic architecture or agentic AI can help address the issue.

Lost in compression

In LLMs, attention mechanisms often force compression of information.

As models scale to handle increasingly complex data, advanced attention techniques like Generalized Query Attention demand that information be condensed for efficiency. While this compression supports streamlined processing, it also limits a model’s ability to retain and articulate the nuanced insights embedded in its middle layers, which are essential for complex reasoning.

The clustering effect

Adding to the challenge is the ‘clustering effect’ of language features in sophisticated LLM architectures such as sparse auto-encoders.

These auto-encoders enable LLMs to store and process language features efficiently. When visualized as point clouds, these features seem to organize, according to an underlying power law, in a way that resembles the brain’s functional lobes, demonstrating specialized pathways like those in human cognition. Emerging studies indicate that this clustering may be influenced by a model’s attention mechanisms, which are likely responsible for creating “hotspots” of information to boost performance.

This efficient organization, however, comes with a trade-off: In compressing and clustering features so densely, LLMs struggle to convey complex, layered reasoning effectively.

Balancing this compression with robust memory architectures and prioritization mechanisms that align more closely with human cognitive strengths could help LLMs retain critical details while making their reasoning pathways more transparent and effective.

Agentic architecture: Merging memory and reasoning

Agentic architecture presents an innovative approach by integrating adaptive memory systems with LLMs’ reasoning pathways.

By incorporating a layered memory structure similar to that of humans, an agentic architecture can enhance an LLM’s ability to handle complex tasks by enabling context-sensitive recall and effective prioritization. Unlike traditional LLM-centric architectures, which treat most information uniformly, agentic architectures would allow for memory differentiation. By mimicking human memory, be it episodic (context-specific experiences), semantic (general knowledge), or working memory (temporary, task-specific information)—all of which are naturally layered—LLMs could selectively access information based on task relevance, thereby reducing information overload. For example, episodic memory could retain task-specific events for short-term reasoning, while semantic memory could handle more general knowledge. Working memory could act as a transient storage for immediate, temporary data required during processing. This adaptive memory layering can streamline task handling and improve reasoning quality, allowing LLMs to focus on high-value knowledge and avoid the pitfalls of static, context-limited memory.

Enhancing LLM reasoning with memory prioritization

Human cognitive efficiency relies heavily on selective attention and neuromodulation.

In neuromodulation, neurotransmitters like dopamine and acetylcholine adjust neural pathways to prioritize frequently accessed or high-value knowledge. These mechanisms allow the brain to deprioritize less relevant information, optimizing cognitive processing.

In agentic architectures, adaptive weighting mechanisms could function like neuromodulators, enabling LLMs to assign importance to different types of memory based on task relevance. For instance, frequently accessed information could be given priority over lesser-used data, allowing a model to “know what matters” in a given context. This prioritization would help reduce processing strain, streamline memory recall, and enhance the quality and accuracy of responses.

Improving attention for task-specific focus: Lessons from human experts

Attention is a crucial factor in human expertise, helping us to manage distractions and focus.

In cognitive models, attention works similarly by focusing computational resources on specific aspects of input data, enabling efficient task processing.

Integrating advanced attention functions in LLMs could replicate the focused and predictive reasoning seen in human experts, who often think multiple steps ahead. In human experts, such predictive insight is supported by heightened activity in the brain’s prefrontal cortex, enabling them to anticipate and address potential steps in a problem-solving sequence. By emulating such attention mechanisms in LLMs, agentic architectures could enable models to navigate multi-step tasks with greater foresight and maintain task-specific focus even in complex conditions.

Memory training: Focusing on 'remembering better'

Agentic architectures should aim to help LLMs ‘remember better.’

The emphasis should be on abstraction over retention of dense, comprehensive data points—just like human memory retains only critical elements rather than every detail.

An agentic architecture approach would focus on capturing essential patterns and principles rather than exhaustive facts, and could significantly improve task performance. Such an adaptive memory training system would prioritize retention of high-value knowledge based on usage frequency and contextual relevance. By focusing on remembering better rather than remembering everything, LLMs could dynamically access pertinent information for each task, optimizing reasoning capabilities without overwhelming the model’s memory systems.

Toward cognitive parity with agentic memory and reasoning

Agentic architectures have the potential to transform LLMs into adaptive reasoning agents with human-like cognitive processes.

By integrating layered memory and selective attention mechanisms and embedding adaptive memory recall and focus functions, LLMs could overcome the constraints of extreme compression and context limitations, evolving into intelligent, memory-supported systems capable of nuanced reasoning.

About the author

Narendran Sivakumar

Narendran Sivakumar is a technopreneur with nearly two decades of experience in driving technological innovations for enterprise solutions. He works on developing artificial intelligence platforms and reasoning systems that enhance productivity and address complex business challenges. During his extensive career at TCS, Narendran has been instrumental in creating applied AI solutions and platforms that meet market needs. He has secured multiple patents featuring novel application of AI to solve big business problems. Narendran holds an MBA from Warwick University and is an alumnus of The Wharton School, University of Pennsylvania, with a specialisation Business Strategy and Management. This blend of technical acumen and strategic business insight enables him to effectively bridge the gap between emerging technologies and market-ready solutions. Passionate about harnessing AI to create sustainable value, Narendran Sivakumar continues to lead initiatives that transform challenges into opportunities, shaping the future of enterprise innovation at TCS and beyond.

Write to me

Contact

TCS is here to make a difference through technology.

We’re in it for good, driving positive change for the benefit of all.

Extraordinary expertise leads to remarkable results.

Want to be a global change-maker? Join our team.

Find the latest news about TCS in our Newsroom

Recent Press releases

Recent News

Recent recognitions

Upcoming events

TCS works hand in hand with world-leading investors.

TCS is here to make a difference through technology.

We’re in it for good, driving positive change for the benefit of all.

Extraordinary expertise leads to remarkable results.

Want to be a global change-maker? Join our team.

Find the latest news about TCS in our Newsroom

Recent Press releases

Recent News

Recent recognitions

Upcoming events

TCS works hand in hand with world-leading investors.

How agentic architectures can improve LLMs’ effectiveness

What we do

HIGHLIGHTS

On this page

‘AI’ on reasoning

Lost in compression

The clustering effect

Agentic architecture: Merging memory and reasoning

Enhancing LLM reasoning with memory prioritization

Improving attention for task-specific focus: Lessons from human experts

Memory training: Focusing on 'remembering better'

Toward cognitive parity with agentic memory and reasoning

About the author

Narendran Sivakumar

Connect with us

TCS is here to make a difference through technology.

We’re in it for good, driving positive change for the benefit of all.

Extraordinary expertise leads to remarkable results.

Want to be a global change-maker? Join our team.

Find the latest news about TCS in our Newsroom

Recent Press releases

Recent News

Recent recognitions

Upcoming events

TCS works hand in hand with world-leading investors.

What we do

HIGHLIGHTS

On this page

‘AI’ on reasoning

Lost in compression

The clustering effect

Agentic architecture: Merging memory and reasoning

Enhancing LLM reasoning with memory prioritization

Improving attention for task-specific focus: Lessons from human experts

Memory training: Focusing on 'remembering better'

Toward cognitive parity with agentic memory and reasoning

About the author

Narendran Sivakumar

Related reading

Connect with us

Accessibility Adjustments