Lam

Large Action Models (LAMs) are advanced AI systems that can understand language and take actions in both digital and physical environments, surpassing traditional Large Language Models (LLMs) which only generate text. LAMs integrate multiple modalities such as language, vision, and motor control, enabling them to perform tasks autonomously and interact with their surroundings. Their applications range from household robotics to assistive technology and they represent a significant step towards achieving Artificial General Intelligence (AGI).

Uploaded by

mohasinmoosi777

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views13 pages

Lam

Uploaded by

mohasinmoosi777

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Technical Seminar

LARGE ACTION MODELs (LAMs)

Under the Guidance of: Presented By:

Prof. Mohammed Siraj B MOHASIN ASIF C K
Assistant Professor 4DM21AI038
Dept. of AIML
INTRODUCTION
 Large Action Models (LAMs) are advanced AI systems designed
not only to understand language like Large Language Models
(LLMs), but also to take actions in digital or physical
environments
 Large Action Models (LAMs) are AI systems that go beyond just
talking — they take action
 While traditional Large Language Models (LLMs) like ChatGPT
specialize in understanding and generating text, LAMs are
designed to understand instructions and physically or digitally act
on them.
 LAMs combine language, vision, planning, and action into a
unified system
 A LAM doesn’t just read or write — it can see through cameras
or screens, understand the situation, plan a sequence of steps, and
execute them effectively
LARGE ACTION MODELS 2
LAMs VS LLMs VS AGENTS
LARGE LANGUAGE MODELS (LLMs)
• LLMs like ChatGPT, GPT-4, and Bard are trained on
massive text data and excel at understanding
questions, having conversations, writing content,
summarizing, translating, and more — all in natural
language.
• But they are passive in the sense that they only
respond with words — they don’t take real actions or
interact with the environment.
• Example: You ask an LLM “How do I book a flight?”
— it gives you a step-by-step explanation.

LARGE ACTION MODELS 3

LAMs VS LLMs VS AGENTS
AGENTS
• Agents are task-oriented systems that can use LLMs
inside them to understand goals and then perform
multi-step actions — like searching the web, using
APIs, writing code, or calling functions.
• Agents usually follow a loop of “Think → Plan →
Act”, and often use tools or plugins to help complete
tasks. They may still need a lot of supervision and are
often specialized for specific workflows.
• Example: An AI agent could search for flights,
compare prices, and send you a booking link — using
a browser plugin or API access.

LARGE ACTION MODELS 4

LAMs VS LLMs VS AGENTS
LARGE ACTION MODELS(LAMs)
• LAMs take agents a step further. Instead of just
calling tools or APIs, they interact with the
environment directly — whether that’s a web
interface, a 3D game, or a real robot.
• LAMs combine language, vision, memory, reasoning,
and motor control into one unified system. They are
built to handle real-time feedback and can act
autonomously in complex environments.
• Example: A LAM could see a room through a camera,
understand the instruction “put the red cup on the
table,” and physically move a robotic arm to complete
the task.

LARGE ACTION MODELS 5

WHY DO WE NEED LAMS

1. Language Alone Isn't Enough

• LLMs can answer questions, but they can’t take action. In real life, humans don’t just
talk — we do. We open doors, move objects, and solve problems through action.
• LAMs bring that same ability to machines — the power to understand and act.

2. Bridging the Gap Between AI and the Real World

• LAMs allow AI to interact with tools, robots, software, and environments in a
meaningful, goal-driven way.
• Whether it’s controlling a drone, navigating a website, or manipulating real objects
LAMs act as intelligent assistants in the physical and digital world

LARGE ACTION MODELS 6

REAL WORLD APPLICATIONS
1. Household Robotics : LAMs can power smart home robots that respond to
natural language and perform physical tasks — like “clean the table,” or “sort
the laundry.”
2. Web Automation and Digital Assistance : LAMs can control software
interfaces — like filling out forms, sending emails, creating reports, or booking
tickets — by understanding user intent and navigating digital environments like
a human user.
3. Assistive Technology for Disabled Users : LAMs can enable AI-powered
systems that help users with physical or visual impairments by controlling
devices, reading screens, or physically interacting with objects on their behalf.
4. Autonomous Scientific Research or Lab Assistance : LAMs could be used in
research labs to carry out multi-step procedures — like preparing chemical
samples, running tests, or recording results
5. Warehouse and Industrial Robotics : In logistics and manufacturing, LAMs
can power robots to move products, sort packages, or assemble parts by
combining spatial awareness, task planning, and precision movement.

LARGE ACTION MODEL 7

ADVANTAGES OF LAMs

1. Multimodal Intelligence : LAMs can understand text, see images or

video, and perceive environments all at once.
2. End-to-End Task Execution : LAMs don’t just answer questions —
they complete full tasks from start to finish
3. Autonomy and Adaptability : LAMs can adjust actions based on real-
time feedback
4. Reduced Human Effort & Automation : LAMs can automate both
physical labor and digital workflows, saving time and effort.
5. Generalization Across Tasks : Once trained, LAMs can perform
multiple tasks without needing to be reprogrammed.

LARGE ACTION MODEL 8

CURRENT LAMS

1. RT-X (by Google DeepMind) : RT stands for "Robotic

Transformer." RT-X is a family of models trained on data from
22 different robot types, across real and simulated environments.
2. WebArena (by Princeton et al.) : A simulated web
environment for training agents to interact with websites using
natural language.
3. Voyager (Minecraft + GPT-4) : Uses GPT-4 to play Minecraft
autonomously. It explores, builds tools, and improves itself over
time
4. ALOHA (by Stanford) : A foundation model for robotic arms.
It was trained using real video and sensor data, allowing robots
to pour drinks, open drawers, or push objects around based on
human commands.
LARGE ACTION MODEL 9
FUTURE SCOPE OF LARGE ACTION MODELS
(LAMs)
• General-Purpose Robots for Everyday Life : LAMs could power intelligent
household robots capable of performing a wide range of daily tasks —
cooking, cleaning, organizing, helping the elderly, or even teaching children
• Unified Digital and Physical AI Assistants : LAMs may serve as true
personal AI agents that operate seamlessly across your phone, computer, and
smart home — handling your emails, booking appointments, ordering
groceries, or even helping you cook dinner by controlling your smart kitchen
devices.
• High-Impact Fields: Healthcare, Disaster Response, Space : LAMs could
be used in critical environments where human presence is risky — like
disaster zones, remote healthcare assistance, or even extraterrestrial
exploration.
• Towards Artificial General Intelligence (AGI) :LAMs represent a step
toward true generalist AI - systems that can handle any task a human can do,
across both the digital and physical world.
LARGE ACTION MODEL 10
CONCLUSION
Large Action Models (LAMs) represent a major leap in artificial intelligence by enabling
systems that can understand, plan, and act in both the physical and digital world. Unlike
traditional AI that only processes information, LAMs bridge language and action — turning
instructions into real-world outcomes. As research progresses, LAMs will unlock new
possibilities in robotics, personal assistants, science, and more. They mark the beginning of
AI systems that don’t just think — they do.

"In the future, you won’t just talk to AI — you’ll work with it, walk with it, and maybe
even live with it."

LARGE ACTION MODEL 11

REFERENCES

•Tao, Y., Yang, J., Ding, D., & Erickson, Z. (2025). LAMS: LLM-Driven Automatic
Mode Switching for Assistive Teleoperation. arXiv. https://arxiv.org/abs/2501.08558
•L., Yang, F., Zhang, C., Lu, J., Qian, J., He, S., Zhao, P., Qiao, B., Huang, R.,
Qin, S., Su, Q., Ye, J., Zhang, Y., Lou, J.-G., Lin, Q., Rajmohan, S., Zhang, D., &
Zhang, Q. (2025). Large Action Models: From Inception to Implementation. arXiv.
https://arxiv.org/abs/2412.10047
•Zhou, W., Wu, Z., Li, Y., Zhou, Z., Chen, W., Zhao, J., & Qiu, X. (2024). xLAM: A
Family of Large Action Models to Empower AI Agent Systems. arXiv preprint
arXiv:2409.03215

LARGE ACTION MODEL 12

THANK YOU

LARGE ACTION MODEL 13

The Havoc We Wreak An Enemies to Lovers College Bully Romance (The Four Book 3) (Becca Steele [Steele, Becca]) (Z-Library)
No ratings yet
The Havoc We Wreak An Enemies to Lovers College Bully Romance (The Four Book 3) (Becca Steele [Steele, Becca]) (Z-Library)
164 pages
New Agentic AI (1)
No ratings yet
New Agentic AI (1)
16 pages
Llm vs Lam vs Lcm vs Lmm
No ratings yet
Llm vs Lam vs Lcm vs Lmm
6 pages
Atharva Presentation
No ratings yet
Atharva Presentation
20 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
CPL - Bpops103
No ratings yet
CPL - Bpops103
55 pages
All AI Lectures Merged
No ratings yet
All AI Lectures Merged
405 pages
Python Flask Developer Interview Questions and Answers.markdown
No ratings yet
Python Flask Developer Interview Questions and Answers.markdown
170 pages
AI Product Essentials Crash Course
No ratings yet
AI Product Essentials Crash Course
270 pages
1 State Space Search
No ratings yet
1 State Space Search
18 pages
Agent Survey
No ratings yet
Agent Survey
35 pages
Agentic Ai Seminar Report-1
No ratings yet
Agentic Ai Seminar Report-1
33 pages
Introduction and History of Artificial Intelligence
No ratings yet
Introduction and History of Artificial Intelligence
90 pages
PDF (Sa1) - Eapp
No ratings yet
PDF (Sa1) - Eapp
4 pages
Agentic Ai Interview Questions
No ratings yet
Agentic Ai Interview Questions
26 pages
Artificial Intelligence Agents & Large Action Models in Digital Government
No ratings yet
Artificial Intelligence Agents & Large Action Models in Digital Government
35 pages
agants3
No ratings yet
agants3
43 pages
Innovations in LLMs Presentation Expanded MSOffice
No ratings yet
Innovations in LLMs Presentation Expanded MSOffice
24 pages
Yokogawa
No ratings yet
Yokogawa
118 pages
AI docs
No ratings yet
AI docs
20 pages
xLAM_250303_024533
No ratings yet
xLAM_250303_024533
16 pages
AI 3 Agent
No ratings yet
AI 3 Agent
46 pages
14
No ratings yet
14
25 pages
A Survey On LLM-Based Agents: Common Workflows and Reusable LLM-Profiled Components
No ratings yet
A Survey On LLM-Based Agents: Common Workflows and Reusable LLM-Profiled Components
20 pages
2309.17288v3
No ratings yet
2309.17288v3
30 pages
LLM Papers Guide
No ratings yet
LLM Papers Guide
6 pages
Sr. Node JS Developer: Bikash Shreshtha Contact Number: (979) 999-1104 Location: Fort Worth, TX
No ratings yet
Sr. Node JS Developer: Bikash Shreshtha Contact Number: (979) 999-1104 Location: Fort Worth, TX
11 pages
Amazon
No ratings yet
Amazon
18 pages
Epicor Mid-Size Customers: Beating The Best-in-Class With Low TCO
No ratings yet
Epicor Mid-Size Customers: Beating The Best-in-Class With Low TCO
8 pages
DX10000HXL31 20000HXL31 Manual
No ratings yet
DX10000HXL31 20000HXL31 Manual
26 pages
CEM
No ratings yet
CEM
2 pages
Advances and Challenges in Foundation Agents
No ratings yet
Advances and Challenges in Foundation Agents
264 pages
What is Large Action Model
No ratings yet
What is Large Action Model
18 pages
navin
No ratings yet
navin
12 pages
Artific Inte
No ratings yet
Artific Inte
16 pages
Function calling at Edge
No ratings yet
Function calling at Edge
9 pages
2309.14365v1 (1)
No ratings yet
2309.14365v1 (1)
15 pages
CH 5 Modern Artificial Intelligence
No ratings yet
CH 5 Modern Artificial Intelligence
5 pages
Root Cause Analysis
100% (2)
Root Cause Analysis
16 pages
Grade8 - Writing A News Report IGCSE
No ratings yet
Grade8 - Writing A News Report IGCSE
12 pages
LLM_Review
No ratings yet
LLM_Review
16 pages
Large Action Model Slams Revolutionizing a i Agents
No ratings yet
Large Action Model Slams Revolutionizing a i Agents
9 pages
LargeActionModelsedied
No ratings yet
LargeActionModelsedied
2 pages
Ece PDF
No ratings yet
Ece PDF
16 pages
Sarah Guo: Every Company Needs An AI Strategy Board Meeting, March 2023
No ratings yet
Sarah Guo: Every Company Needs An AI Strategy Board Meeting, March 2023
22 pages
Digital Shadows Anomali Integration Datasheet
No ratings yet
Digital Shadows Anomali Integration Datasheet
2 pages
Pranay Report-1
No ratings yet
Pranay Report-1
36 pages
Report - PDF 20240827 210738 0000
No ratings yet
Report - PDF 20240827 210738 0000
23 pages
Pranay Report
No ratings yet
Pranay Report
26 pages
Mohamad Lokman Ali, Munirah Ab. Rahman, Nik Shahidah Afifi MD Taujuddin
No ratings yet
Mohamad Lokman Ali, Munirah Ab. Rahman, Nik Shahidah Afifi MD Taujuddin
9 pages
FORMAL SPECIFICATION Revised
No ratings yet
FORMAL SPECIFICATION Revised
39 pages
Synchronous Communication Mechanism Between Agents
No ratings yet
Synchronous Communication Mechanism Between Agents
15 pages
24 July, Class Notes - 01
No ratings yet
24 July, Class Notes - 01
10 pages
Multi-Agentic RAG with Hugging Face Code Agents _ by Gabriele Sgroi, PhD _ Dec, 2024 _ Towards Data Science
No ratings yet
Multi-Agentic RAG with Hugging Face Code Agents _ by Gabriele Sgroi, PhD _ Dec, 2024 _ Towards Data Science
42 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
One Way ANOVA For H0: M
No ratings yet
One Way ANOVA For H0: M
2 pages
It Worksheet (1)
No ratings yet
It Worksheet (1)
7 pages
ZW3D CAD Tips How To Design A A Popular QQ Doll
No ratings yet
ZW3D CAD Tips How To Design A A Popular QQ Doll
10 pages
aa
No ratings yet
aa
11 pages
The Architecture Behind LLM Agents
No ratings yet
The Architecture Behind LLM Agents
2 pages
ARM Cortex-A12 Block Diagram
No ratings yet
ARM Cortex-A12 Block Diagram
1 page
Flat (Or Table) Model Data
No ratings yet
Flat (Or Table) Model Data
6 pages
BSNL Project - Sai PDF
No ratings yet
BSNL Project - Sai PDF
98 pages
Presentation On Ai
No ratings yet
Presentation On Ai
10 pages
ai_agent_overview
100% (2)
ai_agent_overview
33 pages
Introduction to LLM Agents
No ratings yet
Introduction to LLM Agents
2 pages
Data Seminar
No ratings yet
Data Seminar
10 pages
Headbox Control Maintenance Guide
100% (1)
Headbox Control Maintenance Guide
22 pages
LLM Based Agents Synopsis
No ratings yet
LLM Based Agents Synopsis
9 pages
XLAM: Enhancing AI Agents With Salesforce's Large Action Models
No ratings yet
XLAM: Enhancing AI Agents With Salesforce's Large Action Models
8 pages
Lantek Expert
No ratings yet
Lantek Expert
90 pages
Team 5 Task List
No ratings yet
Team 5 Task List
3 pages
Types_of_agents
No ratings yet
Types_of_agents
16 pages
4 Emphysema CT or HRCT Changes
No ratings yet
4 Emphysema CT or HRCT Changes
1 page
LLM - A Introduction To Generative AI
100% (1)
LLM - A Introduction To Generative AI
31 pages
DX Diag
No ratings yet
DX Diag
15 pages
Accu Mark
No ratings yet
Accu Mark
4 pages
Design A Lean Laboratory Layout: Lab Management
No ratings yet
Design A Lean Laboratory Layout: Lab Management
6 pages
The Beginner’s Guide to Local AI – Free AI Run Locally on Your PC
From Everand
The Beginner’s Guide to Local AI – Free AI Run Locally on Your PC
Steven Mcananey
No ratings yet
Manus AI: Redefining Autonomy and Intelligence
From Everand
Manus AI: Redefining Autonomy and Intelligence
koenzi
No ratings yet
AI Agents Revolutionizing The Future Of Work And Life
From Everand
AI Agents Revolutionizing The Future Of Work And Life
Michael Smith
No ratings yet
Ai and the Future of Digital Skills
From Everand
Ai and the Future of Digital Skills
Afflex O. James
No ratings yet
AI Agents: The Future of Work and Innovation
From Everand
AI Agents: The Future of Work and Innovation
Ash P
No ratings yet
Prompt Perfect
From Everand
Prompt Perfect
Muni
No ratings yet
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
From Everand
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
Lila Hartney
No ratings yet
AI for Beginners: A Simple Guide to Artificial Intelligence and Machine Learning
From Everand
AI for Beginners: A Simple Guide to Artificial Intelligence and Machine Learning
Yahya Zakaria
No ratings yet
Multi Agent System: Fundamentals and Applications
From Everand
Multi Agent System: Fundamentals and Applications
Fouad Sabry
No ratings yet
Distributed Artificial Intelligence: Fundamentals and Applications
From Everand
Distributed Artificial Intelligence: Fundamentals and Applications
Fouad Sabry
No ratings yet
Virtual Intelligence: Fundamentals and Applications
From Everand
Virtual Intelligence: Fundamentals and Applications
Fouad Sabry
No ratings yet
Rule Based System: Fundamentals and Applications
From Everand
Rule Based System: Fundamentals and Applications
Fouad Sabry
No ratings yet