SlideShare ist ein Scribd-Unternehmen logo
1 von 68
Downloaden Sie, um offline zu lesen
C280, Computer Vision Prof. Trevor Darrell trevor@eecs.berkeley.edu
Today Administrivia “What is vision?” Schedule Introductions Image formation
Prerequisites This course is appropriate as a first course for graduate students with a EECS background, which should have prepared the students with these essential prerequisites: Data structures A good working knowledge of MATLAB programming (or willingness and time to pick it up quickly!)  Linear algebra  Vector calculus The course does not assume prior imaging experience, computer vision, image processing, or graphics
Text The primary course text will be Rick Szeliski’s draft Computer Vision: Algorithms and Applications; we will use an online copy of the June 19thdraft.    The secondary  text is Forsyth and Ponce, Computer Vision: A Modern Approach.
Primary Text
Secondary Text
Matlab Problem sets and projects will involve Matlab programming (you are free to use alternative packages). Matlab runs on all the Instructional Windows and UNIX systems. Instructions and toolkits are described in http://inst.eecs.berkeley.edu/cgi-bin/pub.cgi?file=matlab.help.   CS280 students can use their existing EECS Windows accounts in EECS instructional labs, and they can request new accounts (for non-majors) or additional access to Instructional resources by following the instructions about ’named’ accounts in http://inst.eecs.berkeley.edu/connecting.html#accounts. They can logon remotely and run it on some of our servers: http://inst.eecs.berkeley.edu/connecting.html#labs
Grading There will be three equal components to the course grade Five  problem sets A take-home exam Final project (including evaluation of proposal document, in-class presentation, and final report) In addition, strong class participation can offset negative performance in any one of the above components.
Problem sets Pset0 – Basic Image Manipulation in Matlab Pset1 – Filtering and Features Pset2 – Geometry and Calibration Pset3 – Recognition  Pset4 – Stereo and Motion Can discuss, but must submit individual work
Take-home Limited time: 3 days Covers everything through hand out date Little programming  *No discussion or collaboration allowed*
Final project Significant novel implementation of technique related to course content Teams of 2 encouraged (document role!) Or journal length review article (no teams) Three components: proposal document (no more than 5 pages) in class results presentation (10 minutes) final write-up (no more than 15 pages)
Class Participation Class participation includes showing up being able to articulate key points from last lecture having read assigned sections and being able to “fill in the blank” during the lecture  I won’t cold call, but will solicit volunteers Strong in-class participation can offset poor performance in one of the other grade components.
Course goals….(broadly speaking) principles of image formation convolution and image pyramids local feature analysis multi-view geometry image warping and stitching structure from motion visual recognition image-based rendering
What is computer vision? 		 		       Done?
What is computer vision? Automatic understanding of images and video Computing properties of the 3D world from visual data (measurement) Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation)
Vision for measurement Multi-view stereo forcommunity photo collections Real-time stereo Structure from motion NASA Mars Rover Pollefeys et al. Goesele et al. Slide credit: L. Lazebnik
Vision for perception, interpretation Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions… amusement park sky The Wicked Twister Cedar Point Ferris wheel ride ride 12 E water Lake Erie ride tree tree people waiting in line people sitting on ride umbrellas tree maxair carousel deck bench tree pedestrians
Related disciplines Graphics Algorithms Artificial intelligence Image processing Cognitive science Machine learning Computer vision
Vision and graphics Model Images Vision Graphics Inverse problems: analysis and synthesis.
Why vision? As image sources multiply, so do applications Relieve humans of boring, easy tasks Enhance human abilities: human-computer interaction, visualization Perception for robotics / autonomous agents Organize and give access to visual content
Why vision? Personal photo albums Movies, news, sports Medical and scientific images Surveillance and security Images and video are everywhere! Slide credit; L. Lazebnik
Again, what is computer vision? Mathematics of geometry of image formation? Statistics of the natural world? Models for neuroscience? Engineering methods for matching images? Science Fiction?
Vision Demo? we’re not quite there yet…. Terminator 2
Every picture tells a story Goal of computer vision is to write computer programs that can interpret images
Can computers match (or beat) human vision? Yes and no (but mostly no!) humans are much better at “hard” things computers can be better at “easy” things
Human perception has its shortcomings… Sinha and Poggio, Nature, 1996
Copyright A.Kitaoka 2003
Current state of the art The next slides show some examples of what current vision systems can do
Earth viewers (3D modeling) Image from Microsoft’s Virtual Earth (see also:  Google Earth)
Photosynth http://labs.live.com/photosynth/ Based on Photo Tourism technologydeveloped by Noah Snavely, Steve Seitz, and Rick Szeliski
Photo Tourism overview Photo Explorer Scene reconstruction Input photographs Relative camera positions and orientations Point cloud Sparse correspondence System for interactive browsing and exploring large collections of photos of a scene. Computes viewpoint of each photo as well as a sparse 3d model of the scene.
Photo Tourism overview
Optical character recognition (OCR) Technology to convert scanned docs to text ,[object Object],Digit recognition, AT&T labs http://www.research.att.com/~yann/ License plate readers http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
Face detection Many new digital cameras now detect faces Canon, Sony, Fuji, …
Smile detection? Sony Cyber-shot® T70 Digital Still Camera
Object recognition (in supermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “
Face recognition Who is she?
Vision-based biometrics “How the Afghan Girl was Identified by Her Iris Patterns”  Read the story
Login without a password… Face recognition systems now beginning to appear more widelyhttp://www.sensiblevision.com/ Fingerprint scanners on many new laptops, other devices
Object recognition (in mobile phones) This is becoming real:                       Microsoft Research Point & Find, Nokia SnapTell.com (now amazon)
Snaptell http://snaptell.com/demos/DemoLarge.htm
Nokia Point and Tell… http://conversations.nokia.com/home/2008/09/point-and-fin-1.html
Special effects:  shape capture The Matrix movies, ESC Entertainment, XYZRGB, NRC
Special effects:  motion capture Pirates of the Carribean, Industrial Light and Magic Click here for interactive demo
Sports Sportvision first down line Nice explanation on www.howstuffworks.com
Smart cars Mobileye Vision systems currently in high-end BMW, GM, Volvo models  By 2010:  70% of car manufacturers. Video demo Slide content courtesy of AmnonShashua
Smart cars Mobileye Vision systems currently in high-end BMW, GM, Volvo models  By 2010:  70% of car manufacturers. Video demo Slide content courtesy of AmnonShashua
Vision-based interaction (and games) Digimask:  put your face on a 3D avatar. Nintendo Wii has camera-based IRtracking built in.  See Lee’s work atCMU on clever tricks on using it tocreate a multi-touch display!  “Game turns moviegoers into Human Joysticks”, CNET Camera tracking a crowd, based on this work.
Vision in space NASA'S Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007.  Vision systems (JPL) used for several tasks ,[object Object]
3D terrain modeling
Obstacle detection, position tracking
For more, read “Computer Vision on Mars” by Matthies et al.,[object Object]
Medical imaging Image guided surgery Grimson et al., MIT 3D imaging MRI, CT
Current state of the art You just saw examples of current systems. Many of these are less than 5 years old This is a very active research area, and rapidly changing Many new apps in the next 5 years To learn more about vision applications and companies David Lowe maintains an excellent overview of vision companies http://www.cs.ubc.ca/spider/lowe/vision.html
Syllabus / Schedule (see handout) http://tinyurl.com/UCBC280CAL Image Formation Color Image Filtering Pyramids & Regularization Feature Detection and Matching Geometric Alignment Geometric Image Stitching Photometric Image Stitching Recognition ,[object Object]
Optic Flow
Dense Motion Models
Shape from Silhouettes
Shape from Shading and Texture
Surface Models
Segmentation
SFM
IBR & HDR…,[object Object]
Let’s get started: Image formation How are objects in the world captured in an image?
Physical parameters of image formation Geometric Type of projection Camera pose Optical Sensor’s lens type focal length, field of view, aperture Photometric Type, direction, intensity of light reaching sensor Surfaces’ reflectance properties
Image formation ,[object Object]
Idea 1:  put a piece of film in front of an object
Do we get a reasonable image?Slide by Steve Seitz

Weitere ähnliche Inhalte

Was ist angesagt?

What is artificial intelligence
What is artificial intelligenceWhat is artificial intelligence
What is artificial intelligenceSulbha Gath
 
Basic questions about artificial intelligence
Basic questions about artificial intelligenceBasic questions about artificial intelligence
Basic questions about artificial intelligenceAqib Memon
 
Rp 3 published
Rp  3 publishedRp  3 published
Rp 3 publishedAman Jain
 
Selected topics in Computer Science
Selected topics in Computer Science Selected topics in Computer Science
Selected topics in Computer Science Melaku Bayih Demessie
 
Activity recognition for video surveillance
Activity recognition for video surveillanceActivity recognition for video surveillance
Activity recognition for video surveillancevariable_orr
 
Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...
Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...
Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...Aaron Sloman
 
IRJET- ATM Security using Machine Learning
IRJET- ATM Security using Machine LearningIRJET- ATM Security using Machine Learning
IRJET- ATM Security using Machine LearningIRJET Journal
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLAlbert Y. C. Chen
 
The Future of Machine Learning
The Future of Machine LearningThe Future of Machine Learning
The Future of Machine LearningRussell Miles
 
SCONUL Summer Conference 2018 - Nicole coleman
SCONUL Summer Conference 2018 - Nicole colemanSCONUL Summer Conference 2018 - Nicole coleman
SCONUL Summer Conference 2018 - Nicole colemansconul
 
Man vs machine consolidated
Man vs machine consolidatedMan vs machine consolidated
Man vs machine consolidatedSriram Venkatram
 
Elements of AI Luxembourg - session 2
Elements of AI Luxembourg - session 2Elements of AI Luxembourg - session 2
Elements of AI Luxembourg - session 2Jeremie Dauphin
 
Artificial Intelligence Research Topics for PhD Manuscripts 2021 - Phdassistance
Artificial Intelligence Research Topics for PhD Manuscripts 2021 - PhdassistanceArtificial Intelligence Research Topics for PhD Manuscripts 2021 - Phdassistance
Artificial Intelligence Research Topics for PhD Manuscripts 2021 - PhdassistancePhD Assistance
 
Building Usable AR Interfaces
Building Usable AR InterfacesBuilding Usable AR Interfaces
Building Usable AR InterfacesMark Billinghurst
 

Was ist angesagt? (20)

What is artificial intelligence
What is artificial intelligenceWhat is artificial intelligence
What is artificial intelligence
 
Basic questions about artificial intelligence
Basic questions about artificial intelligenceBasic questions about artificial intelligence
Basic questions about artificial intelligence
 
Rp 3 published
Rp  3 publishedRp  3 published
Rp 3 published
 
Selected topics in Computer Science
Selected topics in Computer Science Selected topics in Computer Science
Selected topics in Computer Science
 
Introduction
IntroductionIntroduction
Introduction
 
Activity recognition for video surveillance
Activity recognition for video surveillanceActivity recognition for video surveillance
Activity recognition for video surveillance
 
Ai
AiAi
Ai
 
Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...
Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...
Fundamental Questions - The Second Decade of AI: Towards Architectures for Hu...
 
IRJET- ATM Security using Machine Learning
IRJET- ATM Security using Machine LearningIRJET- ATM Security using Machine Learning
IRJET- ATM Security using Machine Learning
 
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DLPractical computer vision-- A problem-driven approach towards learning CV/ML/DL
Practical computer vision-- A problem-driven approach towards learning CV/ML/DL
 
The Future of Machine Learning
The Future of Machine LearningThe Future of Machine Learning
The Future of Machine Learning
 
SCONUL Summer Conference 2018 - Nicole coleman
SCONUL Summer Conference 2018 - Nicole colemanSCONUL Summer Conference 2018 - Nicole coleman
SCONUL Summer Conference 2018 - Nicole coleman
 
15mafaz wali--final (1)
15mafaz wali--final (1)15mafaz wali--final (1)
15mafaz wali--final (1)
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Man vs machine consolidated
Man vs machine consolidatedMan vs machine consolidated
Man vs machine consolidated
 
Elements of AI Luxembourg - session 2
Elements of AI Luxembourg - session 2Elements of AI Luxembourg - session 2
Elements of AI Luxembourg - session 2
 
Ai its nature and future
Ai its nature and futureAi its nature and future
Ai its nature and future
 
Suman
SumanSuman
Suman
 
Artificial Intelligence Research Topics for PhD Manuscripts 2021 - Phdassistance
Artificial Intelligence Research Topics for PhD Manuscripts 2021 - PhdassistanceArtificial Intelligence Research Topics for PhD Manuscripts 2021 - Phdassistance
Artificial Intelligence Research Topics for PhD Manuscripts 2021 - Phdassistance
 
Building Usable AR Interfaces
Building Usable AR InterfacesBuilding Usable AR Interfaces
Building Usable AR Interfaces
 

Andere mochten auch

DIY: Computer Vision with GWT.
DIY: Computer Vision with GWT.DIY: Computer Vision with GWT.
DIY: Computer Vision with GWT.JooinK
 
WBCSD Development Focus Area - Overview as of August 2012
WBCSD Development Focus Area - Overview as of August 2012WBCSD Development Focus Area - Overview as of August 2012
WBCSD Development Focus Area - Overview as of August 2012fveglio
 
14 cv mil_the_pinhole_camera
14 cv mil_the_pinhole_camera14 cv mil_the_pinhole_camera
14 cv mil_the_pinhole_camerazukun
 
CVPR2010: higher order models in computer vision: Part 3
CVPR2010: higher order models in computer vision: Part 3CVPR2010: higher order models in computer vision: Part 3
CVPR2010: higher order models in computer vision: Part 3zukun
 
"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...
"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming..."The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...
"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...Edge AI and Vision Alliance
 
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro..."Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...Edge AI and Vision Alliance
 
General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer visionbutest
 

Andere mochten auch (8)

DIY: Computer Vision with GWT.
DIY: Computer Vision with GWT.DIY: Computer Vision with GWT.
DIY: Computer Vision with GWT.
 
WBCSD Development Focus Area - Overview as of August 2012
WBCSD Development Focus Area - Overview as of August 2012WBCSD Development Focus Area - Overview as of August 2012
WBCSD Development Focus Area - Overview as of August 2012
 
14 cv mil_the_pinhole_camera
14 cv mil_the_pinhole_camera14 cv mil_the_pinhole_camera
14 cv mil_the_pinhole_camera
 
CVPR2010: higher order models in computer vision: Part 3
CVPR2010: higher order models in computer vision: Part 3CVPR2010: higher order models in computer vision: Part 3
CVPR2010: higher order models in computer vision: Part 3
 
"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...
"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming..."The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...
"The OpenCV Open Source Computer Vision Library: What’s New and What’s Coming...
 
Towards Embedded Computer Vision - New @ 2013
Towards Embedded Computer Vision - New @ 2013Towards Embedded Computer Vision - New @ 2013
Towards Embedded Computer Vision - New @ 2013
 
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro..."Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
 
General introduction to computer vision
General introduction to computer visionGeneral introduction to computer vision
General introduction to computer vision
 

Ähnlich wie 01Introduction.pptx - C280, Computer Vision

Ähnlich wie 01Introduction.pptx - C280, Computer Vision (20)

Intro
IntroIntro
Intro
 
Computer vision lightning talk castaway week
Computer vision lightning talk castaway weekComputer vision lightning talk castaway week
Computer vision lightning talk castaway week
 
Everything You Need to Know About Computer Vision
Everything You Need to Know About Computer VisionEverything You Need to Know About Computer Vision
Everything You Need to Know About Computer Vision
 
Introduction
IntroductionIntroduction
Introduction
 
Computer vision
Computer visionComputer vision
Computer vision
 
Computer vision introduction
Computer vision  introduction Computer vision  introduction
Computer vision introduction
 
0-1--Introduction FPCV-0-1.pdf
0-1--Introduction FPCV-0-1.pdf0-1--Introduction FPCV-0-1.pdf
0-1--Introduction FPCV-0-1.pdf
 
Lecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptxLecture No. 1 introduction.pptx
Lecture No. 1 introduction.pptx
 
Materi_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdfMateri_01_VK_2223_3.pdf
Materi_01_VK_2223_3.pdf
 
Saksham presentation
Saksham presentationSaksham presentation
Saksham presentation
 
RMV Artificial Intelligence
RMV Artificial IntelligenceRMV Artificial Intelligence
RMV Artificial Intelligence
 
computer vision.pdf
computer vision.pdfcomputer vision.pdf
computer vision.pdf
 
Mini Project- 3D Graphics And Visualisation
Mini Project- 3D Graphics And VisualisationMini Project- 3D Graphics And Visualisation
Mini Project- 3D Graphics And Visualisation
 
What is Media in MIT Media Lab, Why 'Camera Culture'
What is Media in MIT Media Lab, Why 'Camera Culture'What is Media in MIT Media Lab, Why 'Camera Culture'
What is Media in MIT Media Lab, Why 'Camera Culture'
 
Saksham seminar report
Saksham seminar reportSaksham seminar report
Saksham seminar report
 
Multimediaexercise
MultimediaexerciseMultimediaexercise
Multimediaexercise
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
vision_2.ppt
vision_2.pptvision_2.ppt
vision_2.ppt
 
vision.ppt
vision.pptvision.ppt
vision.ppt
 
Computer vision
Computer visionComputer vision
Computer vision
 

Mehr von butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

Mehr von butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

01Introduction.pptx - C280, Computer Vision

  • 1. C280, Computer Vision Prof. Trevor Darrell trevor@eecs.berkeley.edu
  • 2. Today Administrivia “What is vision?” Schedule Introductions Image formation
  • 3. Prerequisites This course is appropriate as a first course for graduate students with a EECS background, which should have prepared the students with these essential prerequisites: Data structures A good working knowledge of MATLAB programming (or willingness and time to pick it up quickly!) Linear algebra Vector calculus The course does not assume prior imaging experience, computer vision, image processing, or graphics
  • 4. Text The primary course text will be Rick Szeliski’s draft Computer Vision: Algorithms and Applications; we will use an online copy of the June 19thdraft. The secondary text is Forsyth and Ponce, Computer Vision: A Modern Approach.
  • 7. Matlab Problem sets and projects will involve Matlab programming (you are free to use alternative packages). Matlab runs on all the Instructional Windows and UNIX systems. Instructions and toolkits are described in http://inst.eecs.berkeley.edu/cgi-bin/pub.cgi?file=matlab.help. CS280 students can use their existing EECS Windows accounts in EECS instructional labs, and they can request new accounts (for non-majors) or additional access to Instructional resources by following the instructions about ’named’ accounts in http://inst.eecs.berkeley.edu/connecting.html#accounts. They can logon remotely and run it on some of our servers: http://inst.eecs.berkeley.edu/connecting.html#labs
  • 8. Grading There will be three equal components to the course grade Five problem sets A take-home exam Final project (including evaluation of proposal document, in-class presentation, and final report) In addition, strong class participation can offset negative performance in any one of the above components.
  • 9. Problem sets Pset0 – Basic Image Manipulation in Matlab Pset1 – Filtering and Features Pset2 – Geometry and Calibration Pset3 – Recognition Pset4 – Stereo and Motion Can discuss, but must submit individual work
  • 10. Take-home Limited time: 3 days Covers everything through hand out date Little programming *No discussion or collaboration allowed*
  • 11. Final project Significant novel implementation of technique related to course content Teams of 2 encouraged (document role!) Or journal length review article (no teams) Three components: proposal document (no more than 5 pages) in class results presentation (10 minutes) final write-up (no more than 15 pages)
  • 12. Class Participation Class participation includes showing up being able to articulate key points from last lecture having read assigned sections and being able to “fill in the blank” during the lecture I won’t cold call, but will solicit volunteers Strong in-class participation can offset poor performance in one of the other grade components.
  • 13. Course goals….(broadly speaking) principles of image formation convolution and image pyramids local feature analysis multi-view geometry image warping and stitching structure from motion visual recognition image-based rendering
  • 14. What is computer vision? Done?
  • 15. What is computer vision? Automatic understanding of images and video Computing properties of the 3D world from visual data (measurement) Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation)
  • 16. Vision for measurement Multi-view stereo forcommunity photo collections Real-time stereo Structure from motion NASA Mars Rover Pollefeys et al. Goesele et al. Slide credit: L. Lazebnik
  • 17. Vision for perception, interpretation Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions… amusement park sky The Wicked Twister Cedar Point Ferris wheel ride ride 12 E water Lake Erie ride tree tree people waiting in line people sitting on ride umbrellas tree maxair carousel deck bench tree pedestrians
  • 18. Related disciplines Graphics Algorithms Artificial intelligence Image processing Cognitive science Machine learning Computer vision
  • 19. Vision and graphics Model Images Vision Graphics Inverse problems: analysis and synthesis.
  • 20. Why vision? As image sources multiply, so do applications Relieve humans of boring, easy tasks Enhance human abilities: human-computer interaction, visualization Perception for robotics / autonomous agents Organize and give access to visual content
  • 21. Why vision? Personal photo albums Movies, news, sports Medical and scientific images Surveillance and security Images and video are everywhere! Slide credit; L. Lazebnik
  • 22. Again, what is computer vision? Mathematics of geometry of image formation? Statistics of the natural world? Models for neuroscience? Engineering methods for matching images? Science Fiction?
  • 23. Vision Demo? we’re not quite there yet…. Terminator 2
  • 24. Every picture tells a story Goal of computer vision is to write computer programs that can interpret images
  • 25. Can computers match (or beat) human vision? Yes and no (but mostly no!) humans are much better at “hard” things computers can be better at “easy” things
  • 26. Human perception has its shortcomings… Sinha and Poggio, Nature, 1996
  • 28. Current state of the art The next slides show some examples of what current vision systems can do
  • 29. Earth viewers (3D modeling) Image from Microsoft’s Virtual Earth (see also: Google Earth)
  • 30. Photosynth http://labs.live.com/photosynth/ Based on Photo Tourism technologydeveloped by Noah Snavely, Steve Seitz, and Rick Szeliski
  • 31. Photo Tourism overview Photo Explorer Scene reconstruction Input photographs Relative camera positions and orientations Point cloud Sparse correspondence System for interactive browsing and exploring large collections of photos of a scene. Computes viewpoint of each photo as well as a sparse 3d model of the scene.
  • 33.
  • 34. Face detection Many new digital cameras now detect faces Canon, Sony, Fuji, …
  • 35. Smile detection? Sony Cyber-shot® T70 Digital Still Camera
  • 36. Object recognition (in supermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “
  • 38. Vision-based biometrics “How the Afghan Girl was Identified by Her Iris Patterns” Read the story
  • 39. Login without a password… Face recognition systems now beginning to appear more widelyhttp://www.sensiblevision.com/ Fingerprint scanners on many new laptops, other devices
  • 40. Object recognition (in mobile phones) This is becoming real: Microsoft Research Point & Find, Nokia SnapTell.com (now amazon)
  • 42. Nokia Point and Tell… http://conversations.nokia.com/home/2008/09/point-and-fin-1.html
  • 43. Special effects: shape capture The Matrix movies, ESC Entertainment, XYZRGB, NRC
  • 44. Special effects: motion capture Pirates of the Carribean, Industrial Light and Magic Click here for interactive demo
  • 45. Sports Sportvision first down line Nice explanation on www.howstuffworks.com
  • 46. Smart cars Mobileye Vision systems currently in high-end BMW, GM, Volvo models By 2010: 70% of car manufacturers. Video demo Slide content courtesy of AmnonShashua
  • 47. Smart cars Mobileye Vision systems currently in high-end BMW, GM, Volvo models By 2010: 70% of car manufacturers. Video demo Slide content courtesy of AmnonShashua
  • 48. Vision-based interaction (and games) Digimask: put your face on a 3D avatar. Nintendo Wii has camera-based IRtracking built in. See Lee’s work atCMU on clever tricks on using it tocreate a multi-touch display! “Game turns moviegoers into Human Joysticks”, CNET Camera tracking a crowd, based on this work.
  • 49.
  • 52.
  • 53. Medical imaging Image guided surgery Grimson et al., MIT 3D imaging MRI, CT
  • 54. Current state of the art You just saw examples of current systems. Many of these are less than 5 years old This is a very active research area, and rapidly changing Many new apps in the next 5 years To learn more about vision applications and companies David Lowe maintains an excellent overview of vision companies http://www.cs.ubc.ca/spider/lowe/vision.html
  • 55.
  • 59. Shape from Shading and Texture
  • 62. SFM
  • 63.
  • 64. Let’s get started: Image formation How are objects in the world captured in an image?
  • 65. Physical parameters of image formation Geometric Type of projection Camera pose Optical Sensor’s lens type focal length, field of view, aperture Photometric Type, direction, intensity of light reaching sensor Surfaces’ reflectance properties
  • 66.
  • 67. Idea 1: put a piece of film in front of an object
  • 68. Do we get a reasonable image?Slide by Steve Seitz
  • 69.
  • 71. The opening is known as the aperture
  • 72. How does this transform the image?Slide by Steve Seitz
  • 73. Pinhole camera Pinhole camera is a simple model to approximate imaging process, perspective projection. Image plane Virtual image pinhole If we treat pinhole as a point, only one ray from any given point can enter the camera. Fig from Forsyth and Ponce
  • 74. Camera obscura In Latin, means ‘dark room’ "Reinerus Gemma-Frisius, observed an eclipse of the sun at Louvain on January 24, 1544, and later he used this illustration of the event in his book De Radio Astronomica et Geometrica, 1545. It is thought to be the first published illustration of a camera obscura..." Hammond, John H., The Camera Obscura, A Chronicle http://www.acmi.net.au/AIC/CAMERA_OBSCURA.html
  • 75. Camera obscura Jetty at Margate England, 1898. An attraction in the late 19th century Around 1870s http://brightbytes.com/cosite/collection2.html Adapted from R. Duraiswami
  • 76. Camera obscura at home http://blog.makezine.com/archive/2006/02/how_to_room_sized_camera_obscu.html Sketch from http://www.funsci.com/fun3_en/sky/sky.htm
  • 78. Perspective effects Far away objects appear smaller Forsyth and Ponce
  • 80. Perspective effects Parallel lines in the scene intersect in the image Converge in image on horizon line Image plane (virtual) pinhole Scene
  • 81. Slide Credits Slides 14-21, 55-66: Kristen Grauman Slides 23-40,43-52: Steve Seitz and others, as marked…
  • 82. Next time Continue with Image Formation Readings for today: Szeliski, Ch. 1 Readings for next lecture: Szeliski 2.1-2.3.1, Forsyth and Ponce 1.1, 1.4 (optional). Pset 0 released tomorrow, due following Friday