Developing Security Orchestration, Automation, and Response Applications

Apr 30, 20251 like21 views

VICTOR MAESTRE RAMIREZ

More Related Content

More from VICTOR MAESTRE RAMIREZ (20)

Chronicle SIEM: Outcomes & Functions - Google CertificateVICTOR MAESTRE RAMIREZ

Research Mouse Models - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

Laboratory Mouse Basics - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

Mouse Genetic Diversity: Translating Discoveries to HumansVICTOR MAESTRE RAMIREZ

Researcher's guide to omic fundamentals - Fred Hutch Cancer CenterVICTOR MAESTRE RAMIREZ

Systems Engineering Certificate - MathWorksVICTOR MAESTRE RAMIREZ

Exploring Cancer: Biology and Inheritance - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

RNA Biology with Eterna - Stanford UniversityVICTOR MAESTRE RAMIREZ

Introduction to Genetic Engineering - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

Fundamentals of Remote Sensing - NASA CertificateVICTOR MAESTRE RAMIREZ

Ethical Hacker Certificate - Cisco Networking Academy ProgramVICTOR MAESTRE RAMIREZ

Diploma Universidad Internacional Isabel I de Castilla - Máster en Inteligenc...VICTOR MAESTRE RAMIREZ

Convention on Biological Diversity - UN Environment ProgrammeVICTOR MAESTRE RAMIREZ

Special Places in the Ocean - A Decade of Describing Ecologically or Biologic...VICTOR MAESTRE RAMIREZ

The Global Assessment Report on Biodiversity and Ecosystem Services - Summary...VICTOR MAESTRE RAMIREZ

The Law of the Sea - Marine Scientific ResearchVICTOR MAESTRE RAMIREZ

VICTOR MAESTRE RAMIREZ - Business Executive Program en Dirección de Operacion...VICTOR MAESTRE RAMIREZ

Business Executive Program en Dirección de Operaciones - Instituto Europeo de...VICTOR MAESTRE RAMIREZ

DIPLOMA - ESPECIALISTA EN DIRECCIÓN DE OPERACIONESVICTOR MAESTRE RAMIREZ

DIPLOMA - ESPECIALISTA EN ANÁLISIS E INTERPRETACIÓN DE DATOSVICTOR MAESTRE RAMIREZ

Chronicle SIEM: Outcomes & Functions - Google CertificateVICTOR MAESTRE RAMIREZ

Research Mouse Models - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

Laboratory Mouse Basics - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

Mouse Genetic Diversity: Translating Discoveries to HumansVICTOR MAESTRE RAMIREZ

Researcher's guide to omic fundamentals - Fred Hutch Cancer CenterVICTOR MAESTRE RAMIREZ

Systems Engineering Certificate - MathWorksVICTOR MAESTRE RAMIREZ

Exploring Cancer: Biology and Inheritance - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

RNA Biology with Eterna - Stanford UniversityVICTOR MAESTRE RAMIREZ

Introduction to Genetic Engineering - The Jackson LaboratoryVICTOR MAESTRE RAMIREZ

Fundamentals of Remote Sensing - NASA CertificateVICTOR MAESTRE RAMIREZ

Ethical Hacker Certificate - Cisco Networking Academy ProgramVICTOR MAESTRE RAMIREZ

Diploma Universidad Internacional Isabel I de Castilla - Máster en Inteligenc...VICTOR MAESTRE RAMIREZ

Convention on Biological Diversity - UN Environment ProgrammeVICTOR MAESTRE RAMIREZ

Special Places in the Ocean - A Decade of Describing Ecologically or Biologic...VICTOR MAESTRE RAMIREZ

The Global Assessment Report on Biodiversity and Ecosystem Services - Summary...VICTOR MAESTRE RAMIREZ

The Law of the Sea - Marine Scientific ResearchVICTOR MAESTRE RAMIREZ

VICTOR MAESTRE RAMIREZ - Business Executive Program en Dirección de Operacion...VICTOR MAESTRE RAMIREZ

Business Executive Program en Dirección de Operaciones - Instituto Europeo de...VICTOR MAESTRE RAMIREZ

DIPLOMA - ESPECIALISTA EN DIRECCIÓN DE OPERACIONESVICTOR MAESTRE RAMIREZ

DIPLOMA - ESPECIALISTA EN ANÁLISIS E INTERPRETACIÓN DE DATOSVICTOR MAESTRE RAMIREZ

Recently uploaded (20)

Andhra Pradesh Micro Irrigation Project”vzmcareers

CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...ThanushsaranS

03 Daniel 2-notes.ppt seminario escatologiaAlexander Romero Arosquipa

brainstorming-techniques-infographics.pptxmaritzacastro321

Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Abodahab

Geometry maths presentation for begginerszrjacob283

Classification_in_Machinee_Learning.pptxwencyjorda88

Cleaned_Lecture 6666666_Simulation_I.pdfalcinialbob1234

Data Analytics Overview and its applicationsJanmejayaMishra7

How to join illuminati Agent in uganda call+256776963507/0741506136illuminati Agent uganda call+256776963507/0741506136

Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPareaRusan

presentation of first program exist.pptxMajidAzeemChohan

Defense Against LLM Scheming 2025_04_28.pptxGreg Makowski

https://www.meetup.com/sf-bay-acm/events/306888467/ A January 2025 paper called “Frontier Models are Capable of In-Context Scheming”, https://arxiv.org/pdf/2412.04984, demonstrated how a wide variety of current frontier LLM models (i.e. ChatGPT, Claude, Gemini and Llama) can, under specific conditions, scheme to deceive people. Before models can scheme, they need: a) goal-directedness, b) situational awareness, including an opportunity to discover motivations for a different goal, and c) reasoning about scheming, to come up with deceptive strategies. The deceptive strategies were discovered in the “thought traces” of the LLMs, such as from internal chain-of-thoughts dialogues not shown to the end users. For example, given a goal of “solving math problems”, an LLM using its granted situational awareness of files in a file system found that it would be replaced once it got over X% correct on the math exam. It reasoned that it did not want to be deleted, and decided internally to “sandbag” or reduce its performance to stay under the threshold. While these circumstances are initially narrow, the “alignment problem” is a general concern that over time, as frontier LLM models become more and more intelligent, being in alignment with human values becomes more and more important. How can we do this over time? Can we develop a defense against Artificial General Intelligence (AGI) or SuperIntelligence? The presenter discusses a series of defensive steps that can help reduce these scheming or alignment issues. A guardrails system can be set up for real-time monitoring of their reasoning “thought traces” from the models that share their thought traces. Thought traces may come from systems like Chain-of-Thoughts (CoT), Tree-of-Thoughts (ToT), Algorithm-of-Thoughts (AoT) or ReAct (thought-action-reasoning cycles). Guardrails rules can be configured to check for “deception”, “evasion” or “subversion” in the thought traces. However, not all commercial systems will share their “thought traces” which are like a “debug mode” for LLMs. This includes OpenAI’s o1, o3 or DeepSeek’s R1 models. Guardrails systems can provide a “goal consistency analysis”, between the goals given to the system and the behavior of the system. Cautious users may consider not using these commercial frontier LLM systems, and make use of open-source Llama or a system with their own reasoning implementation, to provide all thought traces. Architectural solutions can include sandboxing, to prevent or control models from executing operating system commands to alter files, send network requests, and modify their environment. Tight controls to prevent models from copying their model weights would be appropriate as well. Running multiple instances of the same model on the same prompt to detect behavior variations helps. The running redundant instances can be limited to the most crucial decisions, as an additional check. Preventing self-modifying code, ... (see link for full description)