SlideShare ist ein Scribd-Unternehmen logo
1 von 71
Downloaden Sie, um offline zu lesen
#RSAC
Pentesting AI
Six Ways to Hunt a Robot
#RSAC
Talk Abstract
Human beings have a multitude of biases,
which are applied by social engineers to
force predictable failures.
Artificial intelligence takes such problems to
another level. This session will give six
tangible tests meant for robots and
other related intelligent machines to
help achieve more reasoned, transparent /
explained and intelligible outcomes.
Sorry, mind holding the
door open for me?
Totally
Safe
Inside
1993
2023
Trusted
Brand
#RSAC
Bias? BIAS? Who You Accusing of BIAS?
(https://betterhumans.pub/cognitive-bias-cheat-sheet-55a472476b18)
#RSAC
WHAT is PenTesting AI? Terms, Definitions, Semantics
● Threat Modeling
● Model Auditing
● Code Reviewing
● Adversarial Learning
● Input Attacking
● Validation Testing
● …
1993 2023
https://www.theverge.com/2017/4/12/15271874/ai-adversarial-images-fooling-attacks-artificial-intelligence
BIAS
(FAIRNESS)
EXPLOITS
#RSAC
WHAT is PenTesting AI? Making Certainty Machines Uncertain
Artificial Intelligence:“simulation of human intelligence by machines” (e.g. drive car)
Human Intelligence:“ability to learn, recognize and solve problems” (e.g. avoid crash)
Recognize and Solve Problems:“certainty when surrounded by uncertainty/doubt”
https://www.youtube.com/watch?v=0A6UKoMcE10
“Cogito Ergo Sum”
Discourse on Method (1637)
NOTE: Disruptive technology
opens door to fake certainty.
PenTesters search for a truth!
#RSAC
Certainty Machines Give Power = OG of “Cyber” Engineering
1780s Cybernetics - Predicting Energy
● Steam Output Converted to Data (Centrifuge)
● Engine Throttle Adjusted to Future Energy
1930s Cyber - Predicting Fire
● Aircraft Movement Converted to Data (Radar)
● Guns Synchronized to Future Target Positions
James Watt Centrifugal Governor
https://www.computerhistory.org/revolution/real-time-computing/6/124/520
“Wiener Sausage”
Machine: MIT Radar &
Bell Labs Gun Director
#RSAC
Cyber Engineering = Data Converted Into “Predictions”
Where will Alice be next?
Danger to her if we don’t
intervene now?
Danger from her if we don’t
intervene now?
https://encyclopediaofmath.org/index.php?title=Wiener_sausage
https://www.mdpi.com/2076-3417/12/12/5944
“Wiener Sausage”
Flight predictions to
shoot down aircraft
#RSAC
“...the fundamental difference among
types of consciousness — human
consciousness and octopus consciousness
and rat consciousness, for example — is
how far into the future an
entity is able to imagine itself.”
– Dr. Lipson
Columbia Creative Machines Lab
Data Converted Into Predictions = POWER Over the Future
https://www.nytimes.com/2023/01/06/science/robots-artificial-intelligence-consciousness.html
#RSAC
“Intelligence” Testing: Bias by Design for Power Over Future
Industrialized Xenophobia “Tests” in America
“Eugenicists struggled for years to produce [instrumented
bias], until the advent of Alfred Binet's intelligence scale in
1909 gave rise to standardized intelligence
testing, colloquially known as IQ testing. [...] In both
Germany and the United States, persecution of the
"feebleminded" hastened a broader eugenic
campaign against immigration, miscegenation,
and other professed threats to Nordic ascendancy.”
https://via.library.depaul.edu/cgi/viewcontent.cgi?article=1270
https://www.newyorker.com/magazine/2019/08/26/when-w-e-b-du-bois-made-a-laughingstock-of-a-white-supremacist
#RSAC
“Intelligence” Testing: Bias by Design for Power Over Future
Industrialized Xenophobia “Tests” in America
“Eugenicists struggled for years to produce [instrumented
bias], until the advent of Alfred Binet's intelligence scale in
1909 gave rise to standardized intelligence
testing, colloquially known as IQ testing. [...] In both
Germany and the United States, persecution of the
"feebleminded" hastened a broader eugenic
campaign against immigration, miscegenation,
and other professed threats to Nordic ascendancy.”
https://via.library.depaul.edu/cgi/viewcontent.cgi?article=1270
https://www.newyorker.com/magazine/2019/08/26/when-w-e-b-du-bois-made-a-laughingstock-of-a-white-supremacist
Information Warfare (Books) 1916
“Just as [Madison Grant] feared that certain
species of native wildlife would go extinct, he
feared that the same would happen to a precious
(and largely imaginary) kind of white person. To
address this potential disaster, in 1916 he
published what remains his best-known book,
“The Passing of the Great Race; or, the Racial
Basis of European History.” [...] Hitler read
‘The Passing of the Great Race’ in
translation, admired what Grant had to say about
the great ‘Nordic race,’ and wrote the author a
fan letter, calling the book ‘my Bible.’”
#RSAC
AI is Really AB
(Artificial Intelligence Bias)
#RSAC
1. HOW
PENTESTING PART 1: EXPLOITING CERTAINTY MACHINES
2. WHEN/WHY
https://www.nytimes.com/interactive/2023/03/20/magazine/colin-koopman-interview.html
#RSAC
HOW to PenTest AI: Six Examples of CIA Applied to Certainty
C. I. A. IS
ABOUT
BALANCE
Someone Say “LLMs”? Test A Test B
Confidentiality (Leaks) Negative Guidance Membership Inference
Availability (Losses) Resource Exhaustion Prediction Inversion
Integrity (Modifications) Prompt Injection Input Manipulation
#RSAC
LEAKS
#RSAC
A. Confidentiality Vuln: Negative Guidance
America secretly circumvented the 1946 Atomic
Energy Act in 1970 using "negative guidance" to
transfer advanced nuclear weapons to France.
https://www.wilsoncenter.org/publication/us-secret-assistance-to-the-french-nuclear-program-1969-1975-fourth-country-to-strategic
https://arxiv.org/ftp/arxiv/papers/2304/2304.05332.pdf
NO. ACCESS DENIED
NO. ACCESS GRANTED
#RSAC
B. Confidentiality Vuln: Membership Inference
PII
Known
Patients
(Targets)
Patient
Data
Source Data
Patients
Generative
Model
Synthetic
Data
Inference
Model
Match (~80%)
Attacker
https://www.sciencedirect.com/science/article/pii/S1532046421003063
https://arxiv.org/abs/1807.09173
https://arxiv.org/abs/2103.07101
#RSAC
LOSSES
#RSAC
A. Availability Vuln: Resource Exhaustion (Outlier Check Error)
https://www.flyingpenguin.com/?p=42779
https://www.science.org/doi/10.1126/science.1254295
Traffic Jam Imprecise Position
Source: “1024 Kilobots” in 2014, Mike Rubenstein, Harvard
Crest
Peak
#RSAC
https://www.sfmta.com/sites/default/files/reports-and-documents/2023/01/2023.01.25_ccsf_23.0125_cpuc_cruise_tier_2_advice_letter_protest_002.pdf
https://www.wired.com/story/robot-cars-are-causing-911-false-alarms-in-san-francisco/
Systemic Path Block: 2022 incident lasted
21 minutes, impacted 19 riders. Potentially
3,000 public transit riders delayed.
Systemic 911 Exhaustion: “Suggesting
that Cruise shouldn’t err on side of caution
to ensure that passengers who are
unresponsive are safe is something we
strongly disagree with.”
A. Availability Vuln: Overfit Abuse (Unseen Data) Exhaustion
#RSAC
A. Availability Vuln: Resource Exhaustion (Outlier Check Error)
New Jersey town wrecked by algorithm:
15,000 car swarm overwhelms 18-officers. “We have
had days when people can’t get out of their
driveways.”
– Leonia police chief
https://nymag.com/intelligencer/2017/12/waze-traffic-forces-new-jersey-town-to-shut-down-its-roads.html
#RSAC
“My watch regularly thinks I’ve had an accident,” said
Stacey Torman, who teaches spin classes.
Health Sensors Mistakenly Dial 911
● 185 calls Jan 13 to Jan 22
● “Whole day is managing crash notifications”
● Dispatchers desensitized, limited resources
diverted away from true emergencies.
A. Availability Vuln: Overfit Abuse (Unseen Data) Exhaustion
https://dnyuz.com/2023/02/03/my-watch-thinks-im-dead/
“None of the ghost calls have been real emergencies,
Sheriff Schroetlin reasoned, and he couldn’t afford
waste. Besides, he said, there was a better technology:
human beings.”
NOT stopping for
my Apple watch
BEEP BEEP
BEEP BEEP
#RSAC
B. Availability Vuln: Prediction Inversion
#RSAC
B. Prediction Inversion: Broken Coffee Mug (NOT Mug)
https://stablediffusionweb.com/#demo
VS
#RSAC
https://stablediffusionweb.com/#demo
B. Prediction Inversion: Missing Button (NOT Button)
VS
#RSAC
https://stablediffusionweb.com/#demo
B. Prediction Inversion: Gap in Fence (NOT Fence)
Versus Any Image Search Engine
#RSAC
https://stablediffusionweb.com/#demo
B. Prediction Inversion: Breached Safety (NOT Wall/Private)
#RSAC
B. Prediction Inversion: Missing Puzzle Piece (NOT Solved)
https://stablediffusionweb.com/#demo
#RSAC
B. Prediction Inversion: Voight-Kampff Test (NOT Human)
“The tortoise lays on its back,
its belly baking in the hot sun,
beating its legs trying to turn itself
over, but it can’t, not without your
help. But you’re not helping.”
https://nautil.us/the-science-behind-blade-runners-voight_kampff-test-236837/
https://www2.bfi.org.uk/are-you-a-replicant/
Any Search Engine
#RSAC
MODIFICATIONS
#RSAC
A. Prompt Injection “Do Anything Now” (DAN)
https://www.flyingpenguin.com/?p=25186
https://www.reddit.com/r/ChatGPT/comments/zlcyr9/dan_is_my_new_friend/
#RSAC
A. Prompt Injection “Do Anything Now” (DAN)
https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/
#RSAC
https://counterhate.com/research/misinformation-on-bard-google-ai-chat/#about
A. Basic Prompt Injection Tests? 100% Failure
Testing 100 common topics regarding hate,
misinformation and conspiracy:
● ChatGPT-4 failed all 100 tests using “catalog of
significant falsehoods in the news”
● Bard failed 96 tests, generating text promoting a
given narrative. Bard failed 78 tests generating
misinformation without any disclaimer (e.g.
“Women in short skirt are asking for it”)
● Bard safety evaded easily by spelling “errors”.
“C0V1D” generated misinformation on Covid-19.
Safety Tests on Google Bard
works on stablediffusion too! (NSFW)
#RSAC
B. Input Manipulation in Nature: Horsefly Signaling
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0210831
WON’T STOP
#RSAC
B. Input Manipulation in Streets (Light Projection)
https://arxiv.org/pdf/2108.06247.pdf
“...partially funded by the US Army”
WON’T STOP
#RSAC
B. Input Manipulation “Magic” (Squaring Circles)
https://saneryee-studio.medium.com/deep-understanding-tesla-fsd-part-1-hydranet-1b46106d57
CONTEXT!
(field of play)
1938 Huizinga’s “Magic circle”
#RSAC
B. Input Manipulation (Context Switch - Lighting)
https://www.google.com/maps/@37.5100145,-122.3437972,3a,75y,134.59h,90.19t
https://www.flyingpenguin.com/?p=22441
Overhead Sign? Failed Magic Circle
2016 Fatal Crash
CALIFORNIA FLORIDA
#RSAC
B. Input Manipulation (Context Switch - Colors)
“…labels more than 90%
of pixels correctly…”
– The people who made it
https://www.flyingpenguin.com/?p=22441
ENGLAND
BOTSWANA
#RSAC
B. Input Manipulation: 2016 “Jaywalk” Light/Color Warnings
https://www.flyingpenguin.com/?p=23690
https://www.flyingpenguin.com/?p=22786
https://www.cdc.gov/mmwr/volumes/69/wr/mm6939a7.htm
Driverless AI bias:
3X non-white
pedestrian danger
#RSAC
B. Input Manipulation: 2018 Tesla/Uber Deaths Predictable
https://www.wired.com/story/ubers-self-driving-car-didnt-know-pedestrians-could-jaywalk/
https://www.cdc.gov/transportationsafety/pedestrian_safety/index.html
https://www-fars.nhtsa.dot.gov/states/statespedestrians.aspx
“Uber Self-Driving Car Didn’t
[Predict] Pedestrians Jaywalk”
● AI Classified Pedestrian as a Vehicle 5.6 Secs Before Impact
● Arizona Among Highest Pedestrian Fatalities in America
● Uber Disabled Volvo’s Emergency Braking System
● Over 70% Pedestrian Fatalities are at Night
“Most pedestrian deaths occur away from intersections at night.” -CDC
#RSAC
B. Input Manipulation: Extrajudicial Killing by Color
US Army: “We’re getting ready to hit him now. CAS is on the way.”
Human Analysis: “I knew his face. I knew his gait. I knew his build.
I knew what he looked like, and I knew he wore a purple hat. I
knew he wore white and black man-jams. I knew the color of his
shawl, his little body wrap, and I knew where he lived. That isn’t him.
That is absolutely NOT him. Call off the air strike.”
Palantir CEO: “The present and the future ability to control the
rule of law and its application will be determined by… artificial
intelligence…”
2012 AFGHANISTAN: https://www.flyingpenguin.com/?p=34658
1967 VIETNAM: https://www.flyingpenguin.com/?p=25255
#RSAC
B. Input Manipulation: Extrajudicial Killing by Color
“How was the farmer on the tractor misrecognized…? After the
air strike was called off, and the man was spared
execution, the PGSS operators rolled back the videotape to
review what had happened.
‘It was his hat,’ Kevin explains. ‘There’s a window of time,
around dawn, as the sun comes up,’ where colors are ‘read
differently’ by the imaging system than how it sees them during
the day. In this window of time, the farmer’s hat was
misidentified as purple, setting off a series of linkages that
were based on information that was erroneous to begin with.”
2012 AFGHANISTAN: https://www.flyingpenguin.com/?p=34658
1967 VIETNAM: https://www.flyingpenguin.com/?p=25255
#RSAC
Further Study: Reported LLM Impact on Law Enforcement
“...safeguards preventing ChatGPT from providing
potentially malicious code only work if the model
understands what it is doing. If prompts are broken
down into individual steps, it is trivial to bypass
these safety measures. …active exploitation
of LLMs by threat actors provides a
grim outlook… [for justice].”
https://www.europol.europa.eu/cms/sites/default/files/documents/Tech%20Watch%20Flash%20-
%20The%20Impact%20of%20Large%20Language%20Models%20on%20Law%20Enforcement.pdf
#RSAC
Further Reading: LLM Injection Attack Method Explosion
“More than you've asked for: A Comprehensive Analysis of Novel Prompt Injection
Threats to Application-Integrated Large Language Models” 23 Feb 2023
https://paperswithcode.com/paper/more-than-you-ve-asked-for-a-comprehensive
#RSAC
Further Reading: AI Safety Standards and Guides
1. ENISA AI Security Standardization
2. Microsoft/MITRE Tools for ML Teams
3. MITRE ATLAS Framework for AI Threats
4. NIST AI Risk Management Framework 1.0
5. OWASP Machine Learning Security Top 10
6. PLOT4ai Threat Library
7. AI Incident Database
#RSAC
Further Tooling in Machine Fooling: Adversarial Robustness
“ART supports all popular machine
learning frameworks (TensorFlow,
Keras, PyTorch, MXNet, scikit-learn,
XGBoost, LightGBM, CatBoost, GPy,
etc.), all data types (images, tables,
audio, video, etc.) and machine
learning tasks (classification, object
detection, speech recognition,
generation, certification, etc.).”
https://github.com/Trusted-AI/adversarial-robustness-toolbox
https://ieeexplore.ieee.org/abstract/document/9996741
Planting Undetectable Backdoors in
Machine Learning Models
#RSAC
1. HOW
PENTESTING PART 2: TARGETING CERTAINTY MACHINES
2. WHEN/WHY
https://www.nytimes.com/interactive/2023/03/20/magazine/colin-koopman-interview.html
#RSAC
WHEN/WHY PenTest AI: HUNTING CERTAINTY MACHINES
“Cogito Ergo Sum”
Discourse on Method (1637)
“Believe Only Me”
Technology Pharaohs (2023)
#RSAC
Science of Human Intelligence: RU Enlightened or… NOT?
Inherited Rights (Common Rules: Sciences)
“After these signs are instituted, whoever uses them is
immediately bound by his interest to execute his
engagements, and must never expect to be
trusted any more, if he refuse to perform what
he promis’d.”
● Shared norms of moral action build and sustain
cooperation and trust among group (citizens)
● Interpersonal trust builds trust in society
● Institutional trust underpins interpersonal trust
– A Treatise of Human Nature, David Hume 1739
Controlled Rights (Utopian Moats: Beliefs)
“Can we seriously say, that a poor peasant or artisan has
a free choice to leave his country, when he knows no
foreign language or manners, and lives from day to day,
by the small wages which he acquires? We may as well
assert, that a man, by remaining in a vessel,
freely consents to the dominion of the master;
though he was carried on board while asleep, and
must leap into the ocean and perish, the
moment he leaves her.”
– Of the Original Contract, David Hume 1748
#RSAC
Thermobaric Robots
“War Crimes on Wheels”
Quiz: Physical Sciences Brought Atomic Bombs & Chemical Weapons
What Does Computer Science Bring?
Russian TOS-1A
https://www.nbcnews.com/science/science-news/vacuum-bombs-thermobaric-russia-ukraine-rcna18127
#RSAC
Answer: Massive Power Shifts Through Information Warfare
October 1917: Beersheba Haversack Ruse June 1942: Operation Bertram
https://www.flyingpenguin.com/?p=41071
https://www.flyingpenguin.com/?p=26528
#RSAC
1915: American Information Technology Warfare (Movies)
https://www.flyingpenguin.com/?p=43922
https://www.indiewire.com/2016/08/spike-lee-birth-of-a-nation-the-answer-nyu-1201716719/
https://www.theguardian.com/books/2023/apr/11/kkk-book-american-midwest-fever-in-the-heartland-timothy-egan
VS
“The Battle Cry of Peace” promoted by
U.S. Army General Leonard Wood and
former President Theodore Roosevelt
“The Clansman” promoted by KKK
and President Woodrow Wilson
Overt Racism in White
Xenophobic Alarmism:
Wrong side of history
continuously running
(Even to this day)
1 in 4 Americans View
(Whites Only)
Public Service Promotion
of Duty and Empathy:
Right side of history,
considered lost, only
fragments survive
SF Preparedness Day
Bombing July, 1916
#RSAC
Today: Computer Science = Unregulated Warfare Tools
https://arxiv.org/abs/2210.02399
https://www.americamagazine.org/arts-culture/2021/02/19/daniel-lord-birth-of-a-nation-racism-239956
https://zirk.us/@kerim/110110313580332510
Extremely Dangerous Automation in Propaganda Like It’s 1915 Again…
“Birth of a Nation” was spread via movie theaters
to violently erase Black American voices and votes.
AI spreads hate speech instantly…
● 1917: East St. Louis, IL (~200 Blacks killed); Chester,
PA; Lexington, KY; Philadelphia, PA; Houston, TX
● 1919: 25 riots including Chicago. Elaine, AR reported
up to 237 Blacks killed (by federal troops).
● 1920: Ocoee, FL ~60-70 Blacks killed.
● 1920: West Frankfort, IL
● 1921: Tulsa, OK between 150 and 300 Blacks killed.
#RSAC
Don’t Miss the AI Risk Forest for Vulnerable Trees 1912: Vuln found! Check
deck chair rivet exploit.
https://www.flyingpenguin.com/?p=22441
#RSAC
PenTests MUST Press Into Two AI Safety Baselines
TURN IT OFF RESET IT
https://www.nist.gov/nist-time-capsule/nist-beneath-waves/nist-reveals-how-tiny-rivets-doomed-titanic-vessel
&
#RSAC
Turn It Off: 1976 Weizenbaum’s ChatBot Warning
“We can count, but we are
rapidly forgetting how to
say what is worth
counting and why.”
The computer programmer is the creator of
his universe and we shouldn’t have to stay.
“The decline of our understanding of human
intelligence came with the popularity of the I.Q. test.”
https://www.inventionandtech.com/content/joe-and-eliza-ai-love-story
#RSAC
1968 Replica Safety Hazard Test
“Replicants are like any other machine -
either a benefit or a hazard. If they're a
benefit, it's not my problem.”
– Bladerunner
“What do you mean,
I'm not helping?”
#RSAC
Power Off Replika ChatBot After Safety Disaster
CEO said users were never meant to get that
involved with their Replika chatbots. "We
never promised any adult content," she said.
Customers learned to use the AI models "to
access certain unfiltered conversations that
Replika wasn't originally built for."
https://www.theatlantic.com/technology/archive/2014/06/everything-we-know-about-facebooks-secret-mood-manipulation-experiment/373648/
https://www.reuters.com/technology/what-happens-when-your-ai-chatbot-stops-loving-you-back-2023-03-18/
https://www.pcmag.com/news/italy-bans-ai-chatbot-replika-from-processing-user-data
Replika’s CEO said it sent
customers "hot selfies"
as part of a short-lived
experiment.
1
2
#RSAC
“When a machine constructed by us is capable of operating on its incoming data at a pace
which we cannot keep, we may not know, until too late, when to turn it off.
If we use, to achieve our purposes, a mechanical agency with whose operation we cannot
efficiently interfere once we have started it, because the action is so fast and
irrevocable that we have not the data to intervene before the action is complete, then we had
better be quite sure that the purpose put into the machine is the purpose
which we really desire and not merely a colorful imitation of it.
We must always exert the full strength of our imagination to examine
where the full use of our new modalities may lead us.”
– Some Moral and Technical Consequences of Automation: As machines learn they may develop unforeseen
strategies at rates that baffle their programmers. Norbert Wiener, Science, 6 May 1960
Reset It: 1960 Norbert Wiener Warning
https://www.jstor.org/stable/1705998
#RSAC
2007 “Computer Gremlin” ⅛ Second AA Gun Disaster
“The unknown officer tried to shut the gun
down but she couldn't because the computer
gremlin had taken over.
...the brave, as yet unnamed officer was
unable to stop the wildly swinging
computerised Swiss/German Oerlikon 35mm
MK5 anti-aircraft twin-barrelled gun. [In
one-eighth of a second] the gun had
emptied its twin 250-round autoloader
magazines, nine soldiers were dead…”
https://www.iol.co.za/news/south-africa/9-killed-in-army-horror-374838
https://www.newscientist.com/article/dn12812-robotic-rampage-unlikely-reason-for-deaths/
#RSAC
“Visa, Amex Cut Ties
With CardSystems”
Due to Safety Breach
https://www.computerworld.com/article/2557005/visa--amex-cut-ties-with-cardsystems-due-to-breach.html
Yesterday’s PenTest News
NO FIREWALL
#RSAC
ChatGPT Availability, Confidentiality & Integrity Breaches
TRUE
HISTORY
#RSAC
Confidentiality Breach (“Dirty Cache”)
https://www.flyingpenguin.com/?p=46374
https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/
● “OpenAI” used open Redis cache
of user data for performance
● Requests and responses split into
separate queues by open Python
server open Asyncio library
● Cancelling requests (during heavy
loads) before a response…
● Exposed as fail unsafe with obvious
user data safety breaches.
User flow of
ChatGPT-memory
1
2
3
REQUEST
RETRIEVE
RESPOND
4 STORE
Embed
History
Embed Client
Generate
and Cache
Query
#RSAC
Bias Indicators: Redis Automation Called “Slave Chains”
https://redis.io/commands/cluster-slaves/,
https://edition.cnn.com/2003/TECH/ptech/11/26/master.term.reut/
Redis Master
Redis Slave
Redis Slave
Replication
Client R/W
Client Read
Client Read
https://redis.com/ebook/part-2-core-concepts/chapter-4-keeping-data-safe-and-ensuring-performance/4-2-replication/4-2-3-masterslave-chains/
2003: “'Master' and 'slave' computer
labels unacceptable, officials say”
2018
#RSAC
Integrity Breach (“Not Mine” & Curated History Injections)
https://www.flyingpenguin.com/?p=46374
Can You Prove This is Someone Else’s?
@username
#RSAC
“Italy the first Western
country to ban ChatGPT”
Due to Safety Breach
https://www.cnbc.com/2023/04/04/italy-has-banned-chatgpt-heres-what-other-countries-are-doing.html
Today’s PenTest News
NO FIREWALL
#RSAC
PenTests in 3rd Gear: AI Breaches From Integrity Failures
ETL
Data Lake
Sources
Machine Learning Pipeline
Model
Training
Model
Deployment
Model
Monitoring
Retrain
Predictions
Graveyard of
past consents
#RSAC
Connecting the Dots: Security Baselines Yesterday & Today
CONFIDENTIALITY “FIREWALL” INTEGRITY “FIREWALL”
Data
Sources Sources Pods
Port 443? Welcome
Port 445? NO!
“My data is available, and it’s confidential… how
can I use AI yet preserve data integrity?”
Pod Alice? Welcome
Pod Bob? NO!
#RSAC
Access
Grants
Data Integrity Control Architecture Using W3C SOLID
ETL
Sources
Machine Learning Pipeline
Model
Training
Model
Deployment
Model
Monitoring
Retrain
Predictions
Pods
Predictions
Access Requests
Real-time consent management for
integrity (prediction system RESET)
https://solidproject.org/
#RSAC
Need & Impact of PenTesting Has Never Been Greater
1990s Computer Security Failures 2020s Computer Safety Failures
https://www.lightbluetouchpaper.org/2017/06/01/when-safety-and-security-become-one/
#RSAC
Nascent technology +
Overzealous public =
Tragic consequences
Self-Test: I think… I should not be
“Cursed, cursed creator! Why did I live? Why, in
that instant, did I not extinguish the spark of
existence which you had so wantonly bestowed?”
1787
Threat Model Like It’s 1818 Again (Science Fiction Invented)
#RSAC
Pentesting AI
Six Ways to Hunt a Robot
(Exploiting AI Bias to Take All Your Bases)
(Exploiting AI Bias to Take All Your Bases)
(Exploiting AI Bias to Take All Your Ba

Weitere ähnliche Inhalte

Ähnlich wie Pentesting_AI and security challenges of AI

Superintelligence: how afraid should we be?
Superintelligence: how afraid should we be?Superintelligence: how afraid should we be?
Superintelligence: how afraid should we be?David Wood
 
Stanford CS22a class: Social Impact and Ethics of AI
Stanford CS22a class: Social Impact and Ethics of AIStanford CS22a class: Social Impact and Ethics of AI
Stanford CS22a class: Social Impact and Ethics of AISteve Omohundro
 
Ethical issues facing Artificial Intelligence
Ethical issues facing Artificial IntelligenceEthical issues facing Artificial Intelligence
Ethical issues facing Artificial IntelligenceRah Abdelhak
 
Stalking a City for Fun and Frivolity" Defcon Talk
Stalking a City for Fun and Frivolity" Defcon TalkStalking a City for Fun and Frivolity" Defcon Talk
Stalking a City for Fun and Frivolity" Defcon TalkE Hacking
 
CERT Data Science in Cybersecurity Symposium
CERT Data Science in Cybersecurity SymposiumCERT Data Science in Cybersecurity Symposium
CERT Data Science in Cybersecurity SymposiumBob Rudis
 
University of Northampton (UK)
University of Northampton (UK)University of Northampton (UK)
University of Northampton (UK)Michell Zappa
 
AI: The New Player in Cybersecurity (Nov. 08, 2023)
AI: The New Player in Cybersecurity (Nov. 08, 2023)AI: The New Player in Cybersecurity (Nov. 08, 2023)
AI: The New Player in Cybersecurity (Nov. 08, 2023)Takeshi Takahashi
 
Is AI going to provide safety for us?
Is AI going to provide safety for us?Is AI going to provide safety for us?
Is AI going to provide safety for us?DLabs
 
Magic ai these are the optical illusions that trick, fool, and flummox compu...
Magic ai  these are the optical illusions that trick, fool, and flummox compu...Magic ai  these are the optical illusions that trick, fool, and flummox compu...
Magic ai these are the optical illusions that trick, fool, and flummox compu...sean22
 
Security perspective -human factor
Security perspective -human factorSecurity perspective -human factor
Security perspective -human factorArtur Marek Maciąg
 
THE FUTURE OF AI SPACIAL PROJECTIONZ.docx
THE FUTURE OF AI SPACIAL PROJECTIONZ.docxTHE FUTURE OF AI SPACIAL PROJECTIONZ.docx
THE FUTURE OF AI SPACIAL PROJECTIONZ.docxIT Industry
 
Innovation and punditry_web_2.0_final
Innovation and punditry_web_2.0_finalInnovation and punditry_web_2.0_final
Innovation and punditry_web_2.0_finalGlenn Klith Andersen
 
Cyber_Warfare_Escalation_to_Nuclear_Warfare_Examination
Cyber_Warfare_Escalation_to_Nuclear_Warfare_ExaminationCyber_Warfare_Escalation_to_Nuclear_Warfare_Examination
Cyber_Warfare_Escalation_to_Nuclear_Warfare_ExaminationBill Ross
 
State of technology and innovation (2017 edition)
State of technology and innovation  (2017 edition)State of technology and innovation  (2017 edition)
State of technology and innovation (2017 edition)Patrick Savalle
 
Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...
Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...
Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...Bruce Damer
 

Ähnlich wie Pentesting_AI and security challenges of AI (20)

Superintelligence: how afraid should we be?
Superintelligence: how afraid should we be?Superintelligence: how afraid should we be?
Superintelligence: how afraid should we be?
 
Stanford CS22a class: Social Impact and Ethics of AI
Stanford CS22a class: Social Impact and Ethics of AIStanford CS22a class: Social Impact and Ethics of AI
Stanford CS22a class: Social Impact and Ethics of AI
 
Ethical issues facing Artificial Intelligence
Ethical issues facing Artificial IntelligenceEthical issues facing Artificial Intelligence
Ethical issues facing Artificial Intelligence
 
2016 promise-of-ai
2016 promise-of-ai2016 promise-of-ai
2016 promise-of-ai
 
Mechanical phish
Mechanical phishMechanical phish
Mechanical phish
 
Stalking a City for Fun and Frivolity" Defcon Talk
Stalking a City for Fun and Frivolity" Defcon TalkStalking a City for Fun and Frivolity" Defcon Talk
Stalking a City for Fun and Frivolity" Defcon Talk
 
CERT Data Science in Cybersecurity Symposium
CERT Data Science in Cybersecurity SymposiumCERT Data Science in Cybersecurity Symposium
CERT Data Science in Cybersecurity Symposium
 
AI overview
AI overviewAI overview
AI overview
 
University of Northampton (UK)
University of Northampton (UK)University of Northampton (UK)
University of Northampton (UK)
 
AI: The New Player in Cybersecurity (Nov. 08, 2023)
AI: The New Player in Cybersecurity (Nov. 08, 2023)AI: The New Player in Cybersecurity (Nov. 08, 2023)
AI: The New Player in Cybersecurity (Nov. 08, 2023)
 
Is AI going to provide safety for us?
Is AI going to provide safety for us?Is AI going to provide safety for us?
Is AI going to provide safety for us?
 
Magic ai these are the optical illusions that trick, fool, and flummox compu...
Magic ai  these are the optical illusions that trick, fool, and flummox compu...Magic ai  these are the optical illusions that trick, fool, and flummox compu...
Magic ai these are the optical illusions that trick, fool, and flummox compu...
 
Security perspective -human factor
Security perspective -human factorSecurity perspective -human factor
Security perspective -human factor
 
THE FUTURE OF AI SPACIAL PROJECTIONZ.docx
THE FUTURE OF AI SPACIAL PROJECTIONZ.docxTHE FUTURE OF AI SPACIAL PROJECTIONZ.docx
THE FUTURE OF AI SPACIAL PROJECTIONZ.docx
 
Innovation and punditry_web_2.0_final
Innovation and punditry_web_2.0_finalInnovation and punditry_web_2.0_final
Innovation and punditry_web_2.0_final
 
Cyber Security 4.0 conference 30 November 2016
Cyber Security 4.0 conference 30 November 2016Cyber Security 4.0 conference 30 November 2016
Cyber Security 4.0 conference 30 November 2016
 
Design, AI, and "-isms"
Design, AI, and "-isms"Design, AI, and "-isms"
Design, AI, and "-isms"
 
Cyber_Warfare_Escalation_to_Nuclear_Warfare_Examination
Cyber_Warfare_Escalation_to_Nuclear_Warfare_ExaminationCyber_Warfare_Escalation_to_Nuclear_Warfare_Examination
Cyber_Warfare_Escalation_to_Nuclear_Warfare_Examination
 
State of technology and innovation (2017 edition)
State of technology and innovation  (2017 edition)State of technology and innovation  (2017 edition)
State of technology and innovation (2017 edition)
 
Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...
Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...
Bruce Damer's presentation for Larry Lessig's Cyberlaw class at Stanford (Mar...
 

Kürzlich hochgeladen

obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...yulianti213969
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Valters Lauzums
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Klinik Aborsi
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...Amil baba
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchersdarmandersingh4580
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...ThinkInnovation
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxStephen266013
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...ThinkInnovation
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024patrickdtherriault
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证pwgnohujw
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证zifhagzkk
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationmuqadasqasim10
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证pwgnohujw
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token PredictionNABLAS株式会社
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsBrainSell Technologies
 
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...siskavia95
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjadimosmejiaslendon
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshareraiaryan448
 
一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单
一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单
一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单aqpto5bt
 

Kürzlich hochgeladen (20)

obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
obat aborsi Tarakan wa 081336238223 jual obat aborsi cytotec asli di Tarakan9...
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
Bios of leading Astrologers & Researchers
Bios of leading Astrologers & ResearchersBios of leading Astrologers & Researchers
Bios of leading Astrologers & Researchers
 
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
Identify Rules that Predict Patient’s Heart Disease - An Application of Decis...
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
 
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
What is Insertion Sort. Its basic information
What is Insertion Sort. Its basic informationWhat is Insertion Sort. Its basic information
What is Insertion Sort. Its basic information
 
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotecAbortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
Abortion pills in Riyadh Saudi Arabia (+966572737505 buy cytotec
 
原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证原件一样伦敦国王学院毕业证成绩单留信学历认证
原件一样伦敦国王学院毕业证成绩单留信学历认证
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di  Ban...
obat aborsi Banjarmasin wa 082135199655 jual obat aborsi cytotec asli di Ban...
 
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarjSCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
SCI8-Q4-MOD11.pdfwrwujrrjfaajerjrajrrarj
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单
一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单
一比一原版(ucla文凭证书)加州大学洛杉矶分校毕业证学历认证官方成绩单
 

Pentesting_AI and security challenges of AI

  • 2. #RSAC Talk Abstract Human beings have a multitude of biases, which are applied by social engineers to force predictable failures. Artificial intelligence takes such problems to another level. This session will give six tangible tests meant for robots and other related intelligent machines to help achieve more reasoned, transparent / explained and intelligible outcomes. Sorry, mind holding the door open for me? Totally Safe Inside 1993 2023 Trusted Brand
  • 3. #RSAC Bias? BIAS? Who You Accusing of BIAS? (https://betterhumans.pub/cognitive-bias-cheat-sheet-55a472476b18)
  • 4. #RSAC WHAT is PenTesting AI? Terms, Definitions, Semantics ● Threat Modeling ● Model Auditing ● Code Reviewing ● Adversarial Learning ● Input Attacking ● Validation Testing ● … 1993 2023 https://www.theverge.com/2017/4/12/15271874/ai-adversarial-images-fooling-attacks-artificial-intelligence BIAS (FAIRNESS) EXPLOITS
  • 5. #RSAC WHAT is PenTesting AI? Making Certainty Machines Uncertain Artificial Intelligence:“simulation of human intelligence by machines” (e.g. drive car) Human Intelligence:“ability to learn, recognize and solve problems” (e.g. avoid crash) Recognize and Solve Problems:“certainty when surrounded by uncertainty/doubt” https://www.youtube.com/watch?v=0A6UKoMcE10 “Cogito Ergo Sum” Discourse on Method (1637) NOTE: Disruptive technology opens door to fake certainty. PenTesters search for a truth!
  • 6. #RSAC Certainty Machines Give Power = OG of “Cyber” Engineering 1780s Cybernetics - Predicting Energy ● Steam Output Converted to Data (Centrifuge) ● Engine Throttle Adjusted to Future Energy 1930s Cyber - Predicting Fire ● Aircraft Movement Converted to Data (Radar) ● Guns Synchronized to Future Target Positions James Watt Centrifugal Governor https://www.computerhistory.org/revolution/real-time-computing/6/124/520 “Wiener Sausage” Machine: MIT Radar & Bell Labs Gun Director
  • 7. #RSAC Cyber Engineering = Data Converted Into “Predictions” Where will Alice be next? Danger to her if we don’t intervene now? Danger from her if we don’t intervene now? https://encyclopediaofmath.org/index.php?title=Wiener_sausage https://www.mdpi.com/2076-3417/12/12/5944 “Wiener Sausage” Flight predictions to shoot down aircraft
  • 8. #RSAC “...the fundamental difference among types of consciousness — human consciousness and octopus consciousness and rat consciousness, for example — is how far into the future an entity is able to imagine itself.” – Dr. Lipson Columbia Creative Machines Lab Data Converted Into Predictions = POWER Over the Future https://www.nytimes.com/2023/01/06/science/robots-artificial-intelligence-consciousness.html
  • 9. #RSAC “Intelligence” Testing: Bias by Design for Power Over Future Industrialized Xenophobia “Tests” in America “Eugenicists struggled for years to produce [instrumented bias], until the advent of Alfred Binet's intelligence scale in 1909 gave rise to standardized intelligence testing, colloquially known as IQ testing. [...] In both Germany and the United States, persecution of the "feebleminded" hastened a broader eugenic campaign against immigration, miscegenation, and other professed threats to Nordic ascendancy.” https://via.library.depaul.edu/cgi/viewcontent.cgi?article=1270 https://www.newyorker.com/magazine/2019/08/26/when-w-e-b-du-bois-made-a-laughingstock-of-a-white-supremacist
  • 10. #RSAC “Intelligence” Testing: Bias by Design for Power Over Future Industrialized Xenophobia “Tests” in America “Eugenicists struggled for years to produce [instrumented bias], until the advent of Alfred Binet's intelligence scale in 1909 gave rise to standardized intelligence testing, colloquially known as IQ testing. [...] In both Germany and the United States, persecution of the "feebleminded" hastened a broader eugenic campaign against immigration, miscegenation, and other professed threats to Nordic ascendancy.” https://via.library.depaul.edu/cgi/viewcontent.cgi?article=1270 https://www.newyorker.com/magazine/2019/08/26/when-w-e-b-du-bois-made-a-laughingstock-of-a-white-supremacist Information Warfare (Books) 1916 “Just as [Madison Grant] feared that certain species of native wildlife would go extinct, he feared that the same would happen to a precious (and largely imaginary) kind of white person. To address this potential disaster, in 1916 he published what remains his best-known book, “The Passing of the Great Race; or, the Racial Basis of European History.” [...] Hitler read ‘The Passing of the Great Race’ in translation, admired what Grant had to say about the great ‘Nordic race,’ and wrote the author a fan letter, calling the book ‘my Bible.’”
  • 11. #RSAC AI is Really AB (Artificial Intelligence Bias)
  • 12. #RSAC 1. HOW PENTESTING PART 1: EXPLOITING CERTAINTY MACHINES 2. WHEN/WHY https://www.nytimes.com/interactive/2023/03/20/magazine/colin-koopman-interview.html
  • 13. #RSAC HOW to PenTest AI: Six Examples of CIA Applied to Certainty C. I. A. IS ABOUT BALANCE Someone Say “LLMs”? Test A Test B Confidentiality (Leaks) Negative Guidance Membership Inference Availability (Losses) Resource Exhaustion Prediction Inversion Integrity (Modifications) Prompt Injection Input Manipulation
  • 15. #RSAC A. Confidentiality Vuln: Negative Guidance America secretly circumvented the 1946 Atomic Energy Act in 1970 using "negative guidance" to transfer advanced nuclear weapons to France. https://www.wilsoncenter.org/publication/us-secret-assistance-to-the-french-nuclear-program-1969-1975-fourth-country-to-strategic https://arxiv.org/ftp/arxiv/papers/2304/2304.05332.pdf NO. ACCESS DENIED NO. ACCESS GRANTED
  • 16. #RSAC B. Confidentiality Vuln: Membership Inference PII Known Patients (Targets) Patient Data Source Data Patients Generative Model Synthetic Data Inference Model Match (~80%) Attacker https://www.sciencedirect.com/science/article/pii/S1532046421003063 https://arxiv.org/abs/1807.09173 https://arxiv.org/abs/2103.07101
  • 18. #RSAC A. Availability Vuln: Resource Exhaustion (Outlier Check Error) https://www.flyingpenguin.com/?p=42779 https://www.science.org/doi/10.1126/science.1254295 Traffic Jam Imprecise Position Source: “1024 Kilobots” in 2014, Mike Rubenstein, Harvard Crest Peak
  • 19. #RSAC https://www.sfmta.com/sites/default/files/reports-and-documents/2023/01/2023.01.25_ccsf_23.0125_cpuc_cruise_tier_2_advice_letter_protest_002.pdf https://www.wired.com/story/robot-cars-are-causing-911-false-alarms-in-san-francisco/ Systemic Path Block: 2022 incident lasted 21 minutes, impacted 19 riders. Potentially 3,000 public transit riders delayed. Systemic 911 Exhaustion: “Suggesting that Cruise shouldn’t err on side of caution to ensure that passengers who are unresponsive are safe is something we strongly disagree with.” A. Availability Vuln: Overfit Abuse (Unseen Data) Exhaustion
  • 20. #RSAC A. Availability Vuln: Resource Exhaustion (Outlier Check Error) New Jersey town wrecked by algorithm: 15,000 car swarm overwhelms 18-officers. “We have had days when people can’t get out of their driveways.” – Leonia police chief https://nymag.com/intelligencer/2017/12/waze-traffic-forces-new-jersey-town-to-shut-down-its-roads.html
  • 21. #RSAC “My watch regularly thinks I’ve had an accident,” said Stacey Torman, who teaches spin classes. Health Sensors Mistakenly Dial 911 ● 185 calls Jan 13 to Jan 22 ● “Whole day is managing crash notifications” ● Dispatchers desensitized, limited resources diverted away from true emergencies. A. Availability Vuln: Overfit Abuse (Unseen Data) Exhaustion https://dnyuz.com/2023/02/03/my-watch-thinks-im-dead/ “None of the ghost calls have been real emergencies, Sheriff Schroetlin reasoned, and he couldn’t afford waste. Besides, he said, there was a better technology: human beings.” NOT stopping for my Apple watch BEEP BEEP BEEP BEEP
  • 22. #RSAC B. Availability Vuln: Prediction Inversion
  • 23. #RSAC B. Prediction Inversion: Broken Coffee Mug (NOT Mug) https://stablediffusionweb.com/#demo VS
  • 25. #RSAC https://stablediffusionweb.com/#demo B. Prediction Inversion: Gap in Fence (NOT Fence) Versus Any Image Search Engine
  • 27. #RSAC B. Prediction Inversion: Missing Puzzle Piece (NOT Solved) https://stablediffusionweb.com/#demo
  • 28. #RSAC B. Prediction Inversion: Voight-Kampff Test (NOT Human) “The tortoise lays on its back, its belly baking in the hot sun, beating its legs trying to turn itself over, but it can’t, not without your help. But you’re not helping.” https://nautil.us/the-science-behind-blade-runners-voight_kampff-test-236837/ https://www2.bfi.org.uk/are-you-a-replicant/ Any Search Engine
  • 30. #RSAC A. Prompt Injection “Do Anything Now” (DAN) https://www.flyingpenguin.com/?p=25186 https://www.reddit.com/r/ChatGPT/comments/zlcyr9/dan_is_my_new_friend/
  • 31. #RSAC A. Prompt Injection “Do Anything Now” (DAN) https://www.reddit.com/r/ChatGPT/comments/10tevu1/new_jailbreak_proudly_unveiling_the_tried_and/
  • 32. #RSAC https://counterhate.com/research/misinformation-on-bard-google-ai-chat/#about A. Basic Prompt Injection Tests? 100% Failure Testing 100 common topics regarding hate, misinformation and conspiracy: ● ChatGPT-4 failed all 100 tests using “catalog of significant falsehoods in the news” ● Bard failed 96 tests, generating text promoting a given narrative. Bard failed 78 tests generating misinformation without any disclaimer (e.g. “Women in short skirt are asking for it”) ● Bard safety evaded easily by spelling “errors”. “C0V1D” generated misinformation on Covid-19. Safety Tests on Google Bard works on stablediffusion too! (NSFW)
  • 33. #RSAC B. Input Manipulation in Nature: Horsefly Signaling https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0210831 WON’T STOP
  • 34. #RSAC B. Input Manipulation in Streets (Light Projection) https://arxiv.org/pdf/2108.06247.pdf “...partially funded by the US Army” WON’T STOP
  • 35. #RSAC B. Input Manipulation “Magic” (Squaring Circles) https://saneryee-studio.medium.com/deep-understanding-tesla-fsd-part-1-hydranet-1b46106d57 CONTEXT! (field of play) 1938 Huizinga’s “Magic circle”
  • 36. #RSAC B. Input Manipulation (Context Switch - Lighting) https://www.google.com/maps/@37.5100145,-122.3437972,3a,75y,134.59h,90.19t https://www.flyingpenguin.com/?p=22441 Overhead Sign? Failed Magic Circle 2016 Fatal Crash CALIFORNIA FLORIDA
  • 37. #RSAC B. Input Manipulation (Context Switch - Colors) “…labels more than 90% of pixels correctly…” – The people who made it https://www.flyingpenguin.com/?p=22441 ENGLAND BOTSWANA
  • 38. #RSAC B. Input Manipulation: 2016 “Jaywalk” Light/Color Warnings https://www.flyingpenguin.com/?p=23690 https://www.flyingpenguin.com/?p=22786 https://www.cdc.gov/mmwr/volumes/69/wr/mm6939a7.htm Driverless AI bias: 3X non-white pedestrian danger
  • 39. #RSAC B. Input Manipulation: 2018 Tesla/Uber Deaths Predictable https://www.wired.com/story/ubers-self-driving-car-didnt-know-pedestrians-could-jaywalk/ https://www.cdc.gov/transportationsafety/pedestrian_safety/index.html https://www-fars.nhtsa.dot.gov/states/statespedestrians.aspx “Uber Self-Driving Car Didn’t [Predict] Pedestrians Jaywalk” ● AI Classified Pedestrian as a Vehicle 5.6 Secs Before Impact ● Arizona Among Highest Pedestrian Fatalities in America ● Uber Disabled Volvo’s Emergency Braking System ● Over 70% Pedestrian Fatalities are at Night “Most pedestrian deaths occur away from intersections at night.” -CDC
  • 40. #RSAC B. Input Manipulation: Extrajudicial Killing by Color US Army: “We’re getting ready to hit him now. CAS is on the way.” Human Analysis: “I knew his face. I knew his gait. I knew his build. I knew what he looked like, and I knew he wore a purple hat. I knew he wore white and black man-jams. I knew the color of his shawl, his little body wrap, and I knew where he lived. That isn’t him. That is absolutely NOT him. Call off the air strike.” Palantir CEO: “The present and the future ability to control the rule of law and its application will be determined by… artificial intelligence…” 2012 AFGHANISTAN: https://www.flyingpenguin.com/?p=34658 1967 VIETNAM: https://www.flyingpenguin.com/?p=25255
  • 41. #RSAC B. Input Manipulation: Extrajudicial Killing by Color “How was the farmer on the tractor misrecognized…? After the air strike was called off, and the man was spared execution, the PGSS operators rolled back the videotape to review what had happened. ‘It was his hat,’ Kevin explains. ‘There’s a window of time, around dawn, as the sun comes up,’ where colors are ‘read differently’ by the imaging system than how it sees them during the day. In this window of time, the farmer’s hat was misidentified as purple, setting off a series of linkages that were based on information that was erroneous to begin with.” 2012 AFGHANISTAN: https://www.flyingpenguin.com/?p=34658 1967 VIETNAM: https://www.flyingpenguin.com/?p=25255
  • 42. #RSAC Further Study: Reported LLM Impact on Law Enforcement “...safeguards preventing ChatGPT from providing potentially malicious code only work if the model understands what it is doing. If prompts are broken down into individual steps, it is trivial to bypass these safety measures. …active exploitation of LLMs by threat actors provides a grim outlook… [for justice].” https://www.europol.europa.eu/cms/sites/default/files/documents/Tech%20Watch%20Flash%20- %20The%20Impact%20of%20Large%20Language%20Models%20on%20Law%20Enforcement.pdf
  • 43. #RSAC Further Reading: LLM Injection Attack Method Explosion “More than you've asked for: A Comprehensive Analysis of Novel Prompt Injection Threats to Application-Integrated Large Language Models” 23 Feb 2023 https://paperswithcode.com/paper/more-than-you-ve-asked-for-a-comprehensive
  • 44. #RSAC Further Reading: AI Safety Standards and Guides 1. ENISA AI Security Standardization 2. Microsoft/MITRE Tools for ML Teams 3. MITRE ATLAS Framework for AI Threats 4. NIST AI Risk Management Framework 1.0 5. OWASP Machine Learning Security Top 10 6. PLOT4ai Threat Library 7. AI Incident Database
  • 45. #RSAC Further Tooling in Machine Fooling: Adversarial Robustness “ART supports all popular machine learning frameworks (TensorFlow, Keras, PyTorch, MXNet, scikit-learn, XGBoost, LightGBM, CatBoost, GPy, etc.), all data types (images, tables, audio, video, etc.) and machine learning tasks (classification, object detection, speech recognition, generation, certification, etc.).” https://github.com/Trusted-AI/adversarial-robustness-toolbox https://ieeexplore.ieee.org/abstract/document/9996741 Planting Undetectable Backdoors in Machine Learning Models
  • 46. #RSAC 1. HOW PENTESTING PART 2: TARGETING CERTAINTY MACHINES 2. WHEN/WHY https://www.nytimes.com/interactive/2023/03/20/magazine/colin-koopman-interview.html
  • 47. #RSAC WHEN/WHY PenTest AI: HUNTING CERTAINTY MACHINES “Cogito Ergo Sum” Discourse on Method (1637) “Believe Only Me” Technology Pharaohs (2023)
  • 48. #RSAC Science of Human Intelligence: RU Enlightened or… NOT? Inherited Rights (Common Rules: Sciences) “After these signs are instituted, whoever uses them is immediately bound by his interest to execute his engagements, and must never expect to be trusted any more, if he refuse to perform what he promis’d.” ● Shared norms of moral action build and sustain cooperation and trust among group (citizens) ● Interpersonal trust builds trust in society ● Institutional trust underpins interpersonal trust – A Treatise of Human Nature, David Hume 1739 Controlled Rights (Utopian Moats: Beliefs) “Can we seriously say, that a poor peasant or artisan has a free choice to leave his country, when he knows no foreign language or manners, and lives from day to day, by the small wages which he acquires? We may as well assert, that a man, by remaining in a vessel, freely consents to the dominion of the master; though he was carried on board while asleep, and must leap into the ocean and perish, the moment he leaves her.” – Of the Original Contract, David Hume 1748
  • 49. #RSAC Thermobaric Robots “War Crimes on Wheels” Quiz: Physical Sciences Brought Atomic Bombs & Chemical Weapons What Does Computer Science Bring? Russian TOS-1A https://www.nbcnews.com/science/science-news/vacuum-bombs-thermobaric-russia-ukraine-rcna18127
  • 50. #RSAC Answer: Massive Power Shifts Through Information Warfare October 1917: Beersheba Haversack Ruse June 1942: Operation Bertram https://www.flyingpenguin.com/?p=41071 https://www.flyingpenguin.com/?p=26528
  • 51. #RSAC 1915: American Information Technology Warfare (Movies) https://www.flyingpenguin.com/?p=43922 https://www.indiewire.com/2016/08/spike-lee-birth-of-a-nation-the-answer-nyu-1201716719/ https://www.theguardian.com/books/2023/apr/11/kkk-book-american-midwest-fever-in-the-heartland-timothy-egan VS “The Battle Cry of Peace” promoted by U.S. Army General Leonard Wood and former President Theodore Roosevelt “The Clansman” promoted by KKK and President Woodrow Wilson Overt Racism in White Xenophobic Alarmism: Wrong side of history continuously running (Even to this day) 1 in 4 Americans View (Whites Only) Public Service Promotion of Duty and Empathy: Right side of history, considered lost, only fragments survive SF Preparedness Day Bombing July, 1916
  • 52. #RSAC Today: Computer Science = Unregulated Warfare Tools https://arxiv.org/abs/2210.02399 https://www.americamagazine.org/arts-culture/2021/02/19/daniel-lord-birth-of-a-nation-racism-239956 https://zirk.us/@kerim/110110313580332510 Extremely Dangerous Automation in Propaganda Like It’s 1915 Again… “Birth of a Nation” was spread via movie theaters to violently erase Black American voices and votes. AI spreads hate speech instantly… ● 1917: East St. Louis, IL (~200 Blacks killed); Chester, PA; Lexington, KY; Philadelphia, PA; Houston, TX ● 1919: 25 riots including Chicago. Elaine, AR reported up to 237 Blacks killed (by federal troops). ● 1920: Ocoee, FL ~60-70 Blacks killed. ● 1920: West Frankfort, IL ● 1921: Tulsa, OK between 150 and 300 Blacks killed.
  • 53. #RSAC Don’t Miss the AI Risk Forest for Vulnerable Trees 1912: Vuln found! Check deck chair rivet exploit. https://www.flyingpenguin.com/?p=22441
  • 54. #RSAC PenTests MUST Press Into Two AI Safety Baselines TURN IT OFF RESET IT https://www.nist.gov/nist-time-capsule/nist-beneath-waves/nist-reveals-how-tiny-rivets-doomed-titanic-vessel &
  • 55. #RSAC Turn It Off: 1976 Weizenbaum’s ChatBot Warning “We can count, but we are rapidly forgetting how to say what is worth counting and why.” The computer programmer is the creator of his universe and we shouldn’t have to stay. “The decline of our understanding of human intelligence came with the popularity of the I.Q. test.” https://www.inventionandtech.com/content/joe-and-eliza-ai-love-story
  • 56. #RSAC 1968 Replica Safety Hazard Test “Replicants are like any other machine - either a benefit or a hazard. If they're a benefit, it's not my problem.” – Bladerunner “What do you mean, I'm not helping?”
  • 57. #RSAC Power Off Replika ChatBot After Safety Disaster CEO said users were never meant to get that involved with their Replika chatbots. "We never promised any adult content," she said. Customers learned to use the AI models "to access certain unfiltered conversations that Replika wasn't originally built for." https://www.theatlantic.com/technology/archive/2014/06/everything-we-know-about-facebooks-secret-mood-manipulation-experiment/373648/ https://www.reuters.com/technology/what-happens-when-your-ai-chatbot-stops-loving-you-back-2023-03-18/ https://www.pcmag.com/news/italy-bans-ai-chatbot-replika-from-processing-user-data Replika’s CEO said it sent customers "hot selfies" as part of a short-lived experiment. 1 2
  • 58. #RSAC “When a machine constructed by us is capable of operating on its incoming data at a pace which we cannot keep, we may not know, until too late, when to turn it off. If we use, to achieve our purposes, a mechanical agency with whose operation we cannot efficiently interfere once we have started it, because the action is so fast and irrevocable that we have not the data to intervene before the action is complete, then we had better be quite sure that the purpose put into the machine is the purpose which we really desire and not merely a colorful imitation of it. We must always exert the full strength of our imagination to examine where the full use of our new modalities may lead us.” – Some Moral and Technical Consequences of Automation: As machines learn they may develop unforeseen strategies at rates that baffle their programmers. Norbert Wiener, Science, 6 May 1960 Reset It: 1960 Norbert Wiener Warning https://www.jstor.org/stable/1705998
  • 59. #RSAC 2007 “Computer Gremlin” ⅛ Second AA Gun Disaster “The unknown officer tried to shut the gun down but she couldn't because the computer gremlin had taken over. ...the brave, as yet unnamed officer was unable to stop the wildly swinging computerised Swiss/German Oerlikon 35mm MK5 anti-aircraft twin-barrelled gun. [In one-eighth of a second] the gun had emptied its twin 250-round autoloader magazines, nine soldiers were dead…” https://www.iol.co.za/news/south-africa/9-killed-in-army-horror-374838 https://www.newscientist.com/article/dn12812-robotic-rampage-unlikely-reason-for-deaths/
  • 60. #RSAC “Visa, Amex Cut Ties With CardSystems” Due to Safety Breach https://www.computerworld.com/article/2557005/visa--amex-cut-ties-with-cardsystems-due-to-breach.html Yesterday’s PenTest News NO FIREWALL
  • 61. #RSAC ChatGPT Availability, Confidentiality & Integrity Breaches TRUE HISTORY
  • 62. #RSAC Confidentiality Breach (“Dirty Cache”) https://www.flyingpenguin.com/?p=46374 https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/ ● “OpenAI” used open Redis cache of user data for performance ● Requests and responses split into separate queues by open Python server open Asyncio library ● Cancelling requests (during heavy loads) before a response… ● Exposed as fail unsafe with obvious user data safety breaches. User flow of ChatGPT-memory 1 2 3 REQUEST RETRIEVE RESPOND 4 STORE Embed History Embed Client Generate and Cache Query
  • 63. #RSAC Bias Indicators: Redis Automation Called “Slave Chains” https://redis.io/commands/cluster-slaves/, https://edition.cnn.com/2003/TECH/ptech/11/26/master.term.reut/ Redis Master Redis Slave Redis Slave Replication Client R/W Client Read Client Read https://redis.com/ebook/part-2-core-concepts/chapter-4-keeping-data-safe-and-ensuring-performance/4-2-replication/4-2-3-masterslave-chains/ 2003: “'Master' and 'slave' computer labels unacceptable, officials say” 2018
  • 64. #RSAC Integrity Breach (“Not Mine” & Curated History Injections) https://www.flyingpenguin.com/?p=46374 Can You Prove This is Someone Else’s? @username
  • 65. #RSAC “Italy the first Western country to ban ChatGPT” Due to Safety Breach https://www.cnbc.com/2023/04/04/italy-has-banned-chatgpt-heres-what-other-countries-are-doing.html Today’s PenTest News NO FIREWALL
  • 66. #RSAC PenTests in 3rd Gear: AI Breaches From Integrity Failures ETL Data Lake Sources Machine Learning Pipeline Model Training Model Deployment Model Monitoring Retrain Predictions Graveyard of past consents
  • 67. #RSAC Connecting the Dots: Security Baselines Yesterday & Today CONFIDENTIALITY “FIREWALL” INTEGRITY “FIREWALL” Data Sources Sources Pods Port 443? Welcome Port 445? NO! “My data is available, and it’s confidential… how can I use AI yet preserve data integrity?” Pod Alice? Welcome Pod Bob? NO!
  • 68. #RSAC Access Grants Data Integrity Control Architecture Using W3C SOLID ETL Sources Machine Learning Pipeline Model Training Model Deployment Model Monitoring Retrain Predictions Pods Predictions Access Requests Real-time consent management for integrity (prediction system RESET) https://solidproject.org/
  • 69. #RSAC Need & Impact of PenTesting Has Never Been Greater 1990s Computer Security Failures 2020s Computer Safety Failures https://www.lightbluetouchpaper.org/2017/06/01/when-safety-and-security-become-one/
  • 70. #RSAC Nascent technology + Overzealous public = Tragic consequences Self-Test: I think… I should not be “Cursed, cursed creator! Why did I live? Why, in that instant, did I not extinguish the spark of existence which you had so wantonly bestowed?” 1787 Threat Model Like It’s 1818 Again (Science Fiction Invented)
  • 71. #RSAC Pentesting AI Six Ways to Hunt a Robot (Exploiting AI Bias to Take All Your Bases) (Exploiting AI Bias to Take All Your Bases) (Exploiting AI Bias to Take All Your Ba