the opportunities and challenges of putting the latest computer vision and deep learning algorithms...
TRANSCRIPT
![Page 1: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/1.jpg)
Putting the latest Computer Vision and Deep Learning algorithms to work
The Opportunities and Challenges
Albert Y. C. Chen, Ph.D.Vice President, R&D
Viscovery
![Page 2: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/2.jpg)
Albert Y. C. Chen, Ph.D. • Experience
2017-present: Vice President of R&D at Viscovery 2016-2017: Chief Scientist at Viscovery 2015: Principal Scientist @ Nervve Technologies 2013-2014 Computer Vision Scientist @ Tandent Vision 2011-2012 @ GE Global Research
• Education Ph.D. in Computer Science, SUNY-Buffalo M.S. in Computer Science, NTNU B.S. in Computer Science, NTHU
• Some random things about me… SUNY Excellence in Teaching Award, 2010. Some rapid promotions, some failed startups, some patents, some papers…
![Page 3: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/3.jpg)
1. W. Wu, A. Y. C. Chen, L. Zhao, and J. J. Corso. Brain tumor detection and segmentation in a CRF framework with pixel-wise affinity and superpixel-level features. International Journal of Computer Assisted Radiology and Surgery, 2015.
2. S. N. Lim, A. Y. C. Chen and X. Yang. Parameter Inference Engine (PIE) on the Pareto Front. In Proceedings of International Conference of Machine Learning, Auto ML Workshop, 2014.
3. A. Y. C. Chen, S. Whitt, C. Xu, and J. J. Corso. Hierarchical supervoxel fusion for robust pixel label propagation in videos. In Submission to ACM Multimedia, 2013.
4. A.Y.C. Chen and J.J. Corso. Temporally consistent multi-class video-object segmentation with the video graph-shifts algorithm. In Proceedings of IEEE Workshop on Applications of Computer Vision, 2011.
5. D.R. Schlegel, A.Y.C. Chen, C. Xiong, J.A. Delmerico, and J.J. Corso. Airtouch: Interacting with computer systems at a distance. In Proceedings of IEEE Workshop on Applications of Computer Vision, 2011.
6. A.Y.C. Chen and J.J. Corso. On the effects of normalization in adaptive MRF Hierarchies. In Proceedings of International Symposium CompIMAGE, 2010.
7. A.Y.C. Chen and J.J. Corso. Propagating multi-class pixel labels throughout video frames. In Proceedings of IEEE Western New York Image Processing Workshop, 2010.
8. A. Y. C. Chen and J. J. Corso. On the effects of normalization in adaptive MRF Hierarchies. Computational Modeling of Objects Represented in Images, pages 275–286, 2010.
9. Y. Tao, L. Lu, M. Dewan, A. Y. C. Chen, J. J. Corso, J. Xuan, M. Salganicoff, and A. Krishnan. Multi-level ground glass nodule detection and segmentation in ct lung images. Medical Image Computing and Computer-Assisted Intervention, 2009.
10. A.Y.C. Chen, J.J. Corso, and L. Wang. Hops: Efficient region labeling using higher order proxy neighborhoods. In Proceedings of IEEE International Conference on Pattern Recognition, 2008.
![Page 4: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/4.jpg)
Some work done before I caught the startup fever
Freestyle Sketching Stage
AirTouch waits in background for the initialization signal
Initialize
Terminate
Output
imagedatabase
Start:Results
CBIRquery
Airtouch HCI interface for Content-based Image Retrieval
![Page 5: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/5.jpg)
Interactive Segmentation & Classification• Segmentation then classification:
• computationally more efficient, • results in much higher classification accuracy.
• Pioneered the “pixel label propagation” field. • First to utilize superpixels and supervoxels for the task.
FG
Traditional Spatial Propagation
Pixel label map
Label a subset of pixels
BG
Spatio-temporal Propagation
time
![Page 6: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/6.jpg)
Image/Video Object Recognition and Content Understanding
approaches
person carries
gives
recieves
Ontology
object
Person 1 Person 1Person 2
High-Level
Mid-Level
approachactivity
receives givescarries
activityactivity activity
Time
Reasoning
xx
x
Low-Level
x x
x
x
![Page 7: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/7.jpg)
Learning and Adapting Optimal Classifier Parameters
subspace B
subsp
ace A
subspace C
Image-level feature space
priors
Patch-level feature space
posteriorprobability
suggest optimal parameter configuration
![Page 8: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/8.jpg)
Graphical Models and Stochastic Optimization
A
(a) The space-time volume of a video showing the objects (A--F) and their appearing time-span.
spac
e
time
AB
C
D
E F
B E
F
C
D
(b) The temporal relationship graph. An edge between two vertices mean that the two objects overlap in time.
(c) The goal is: cover all objects with the smallest number of "ground truth key frames".
spac
e
time
AB
C
D
E F
key 1 key 2
A
B E
F
C
D
(d) This translates to: iteratively solving the max clique problem until all vertices belong to a clique.
A
B E
F
C
Dkey 2
key 1frame t-1 frame t
layer n layer n
layer n+1 layer n+1
TemporalShift
Shift
µ
![Page 9: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/9.jpg)
Medical Imaging and Geospatial Imaging
GNN detection and segmentation
in Lung CT geospatial imaging: building detection
Brain tumor detection and segmentation in MR images.
![Page 10: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/10.jpg)
Why Risk to Innovate?
• Good business model NEVER last forever.
• Average “shelf life” on S&P 500: 20 years.
• 100-year old companies constantly reinvent themselves every 10-20 years
• Startups contribute to 20% of USA’s GDP.
![Page 11: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/11.jpg)
The Death of a Good Business Model
• Foxconn 20 year revenue v.s. net profit (now at 5%)
![Page 12: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/12.jpg)
What do 100 year old corporations do?
GE Schenectady, 1896
![Page 13: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/13.jpg)
History of change at GE• 1886: one of the 12 original companies on the Dow
Jone Industrial Average (also the only one remaining). • 1889: lightbulbs • 1919: radios • 1927: TV • 1941: jet engine • 1960: nuclear power • 1971: room AC units • 1995: MRI
![Page 14: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/14.jpg)
History of change at IBM• 1960s: mainframe computer • 1980s: personal computer • 2000s: integrated solutions • 2020s: AI, Watson
![Page 15: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/15.jpg)
How about the leading Semiconductor companies?
![Page 16: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/16.jpg)
NVidia reinventing itself —2 times in 20 years
![Page 17: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/17.jpg)
“Bad money drives out good” in the desktop GPU market
![Page 18: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/18.jpg)
The rise of mobile computing, and how NVidia missed the boat!
![Page 19: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/19.jpg)
NVidia’s Tegra mobile processors never took off
then, the market saturated…
![Page 20: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/20.jpg)
NVidia not just survived. NVidia is thriving!
![Page 21: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/21.jpg)
Meet the new NVidia: Deep Learning, Deep Learning, and still, Deep Learning
![Page 22: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/22.jpg)
![Page 23: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/23.jpg)
The king is dead, long live the king!
![Page 24: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/24.jpg)
Now, again, do we want to do OEM/ODM forever?
Optimizing an old business model is just delaying its eventual death.
![Page 25: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/25.jpg)
Startups• A company, partnership, or temporary
organization designed to search for a new, repeatable and scalable business model.
![Page 26: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/26.jpg)
Your Idea• Are you passionate about it? • Is it disruptive enough? • What is your business plan?
• What is it? • Can it make money? • What is the future of the idea?
• What is your competitive advantage? • How do you build up your entry barrier?
![Page 27: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/27.jpg)
A minimal startup team
• A hacker
• A hustler
• A hipster
![Page 28: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/28.jpg)
![Page 29: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/29.jpg)
Startup Timeline
![Page 30: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/30.jpg)
Prototype• Hack out a prototype
• Spend 2-10 weeks max.
• Investors are much more likely to fund you if you have a minimal initial version of your idea.
• Hackathons are a good place to start.
• Iteratively improve the prototype
![Page 31: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/31.jpg)
Money!
![Page 32: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/32.jpg)
![Page 33: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/33.jpg)
Buildup your entry barrier!
• Market (users)
• Speed
• Team
• Technology
![Page 34: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/34.jpg)
Building entry barrier with Technology!!
![Page 35: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/35.jpg)
Computer Vision, it can’t be that hard, right?
![Page 36: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/36.jpg)
Brief History
Marvin Minsky
“In 1966, Minsky hired a first-year undergraduate student and assigned him a problem to solve over the summer: connect a television camera to a computer and get the machine to describe what it sees.”
Gerald SussmanThe student never worked on Computer Vision problems again.
![Page 37: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/37.jpg)
Brief History• 1960’s: interpretation of synthetic worlds • 1970’s: some progress on interpreting selected images • 1980’s: ANNs come and go; shift toward geometry and increased
mathematical rigor • 1990’s: face recognition; statistical analysis in vogue • 2000’s: broader recognition; large annotated datasets available; video
processing starts
Guzman ‘68 Ohta Kanade ‘78 Turk and Pentland ‘91
![Page 38: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/38.jpg)
What’s in our arsenal?
• Image filters
• Feature descriptors
• Classifiers
![Page 39: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/39.jpg)
Filters: blurring
![Page 40: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/40.jpg)
Filters: sharpen
![Page 41: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/41.jpg)
Filters: edge
![Page 42: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/42.jpg)
Filters: straight lines
![Page 43: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/43.jpg)
Features:
![Page 44: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/44.jpg)
Features: Harris Corners
![Page 45: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/45.jpg)
Features: Laplacian of Gaussian (LoG; scale detection)
![Page 46: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/46.jpg)
Features: OrientationHow to compute the rotation?
Create edge orientation histogram and find peak.
![Page 47: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/47.jpg)
Features: SIFT
![Page 48: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/48.jpg)
Features: SIFT
![Page 49: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/49.jpg)
Features: Gabor
![Page 50: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/50.jpg)
Classifiers: SVM
![Page 51: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/51.jpg)
Classifiers: Ensemble
![Page 52: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/52.jpg)
Classifiers: Random Fields
![Page 53: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/53.jpg)
Classifiers: Deformable Parts Model (DPM)
![Page 54: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/54.jpg)
Classifiers: Deep Neural Network
![Page 55: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/55.jpg)
What alg. should I use then?• How much data do we have? • What objects are we trying to detect? • For example, Google’s DNN trained with 11k images
over 20 classes in 2013 doesn’t always beat DPM.
00.150.3
0.450.6
aero bike bird boat bottle bus car cat chair cow
00.150.3
0.450.6
dog horse m-bike person plant sheep sofa table train TV
D N N
D P M
![Page 56: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/56.jpg)
ML alg. and their Applications• Deep
Learning
• Markovian/Bayesian
• Feature Matching
• Other ML methods
![Page 57: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/57.jpg)
Meta-Learning• Different use
cases calls for different ML algorithms.
• Meta-Learning: learning how to learn.
• Requires plenty of domain-specific know-how.
![Page 58: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/58.jpg)
Maturing Computer Vision Applications
![Page 59: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/59.jpg)
• Final inspection cells • Robot guidance and
checking orientation of components
• Packaging Inspection • Medical vial inspection • Food pack checks • Verifying engineered
components[5] • Wafer Dicing • Reading of Serial
Numbers • Inspection of Saw
Blades
• Inspection of Ball Grid Arrays (BGAs)
• Surface Inspection • Measuring of Spark
Plugs • Molding Flash Detection • Inspection of Punched
Sheets • 3D Plane
Reconstruction with Stereo
• Pose Verification of Resistors
• Classification of Non-Woven Fabrics
1970s-now: Machine Vision for Industrial Inspection
• Automated Train Examiner (ATEx) Systems
• Automatic PCB inspection
• Wood quality inspection
• Final inspection of sub-assemblies
• Engine part inspection • \Label inspection on
products • Checking medical
devices for defects
![Page 60: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/60.jpg)
Industrial Inspection: turbofan jet engine blade maintenance• Some seemingly daunting
machine vision tasks actually works with relatively simple image processing algorithms.
![Page 61: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/61.jpg)
Industrial Inspection: Cognex Omniview
![Page 62: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/62.jpg)
Industrial Inspection: Cognex Omniview
![Page 63: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/63.jpg)
License Plate Recognition (1979-now)
![Page 64: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/64.jpg)
License Plate Readers with Text Detection and Neural Networks
![Page 65: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/65.jpg)
Biometrics
![Page 66: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/66.jpg)
Automated Fingerprint Identification (1970s-now)
![Page 67: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/67.jpg)
Face Recognition (1990s-now)
• Face Detection (Viola and Jones, 2001)
• Face Verification (1:1) v.s. Identification (1:N)
![Page 68: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/68.jpg)
Face Verification and Identification, Labeled Faces in the Wild (LFW)
Recognition Accuracy: • 1 to 1: 99%+ • 1 to 100: 90% • 1 to 10,000:
50%-70%. • 1 to 1M: 30%.
LFW dataset, common FN↑, FP↓
![Page 69: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/69.jpg)
![Page 70: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/70.jpg)
Sports—NFL first down line (1995-now)
![Page 71: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/71.jpg)
Sports—NFL first down line
minus
equals
![Page 72: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/72.jpg)
3D Reconstruction(As old as CV; became practical since SIFT)
![Page 73: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/73.jpg)
3D Reconstruction with Feature Matching, Structure from Motion
![Page 74: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/74.jpg)
3D Reconstruction with Feature Matching, Structure from Motion
![Page 75: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/75.jpg)
Image Panoramas (1980s - now)
![Page 76: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/76.jpg)
Solving Panorama Problem with Markov Random Fields
Input:
![Page 77: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/77.jpg)
Solving Panorama Problem with Markov Random Fields
Input:
![Page 78: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/78.jpg)
Solving Panorama Problem with Markov Random Fields
Input:
![Page 79: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/79.jpg)
Solving Panorama Problem with Markov Random Fields
Input:
![Page 80: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/80.jpg)
Solving Panorama Problem with Markov Random Fields
Input:
![Page 81: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/81.jpg)
Solving Panorama Problem with Markov Random Fields
Input:
![Page 82: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/82.jpg)
Solving Panorama Problem with Markov Random Fields
Input:
![Page 83: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/83.jpg)
Solving Panorama Problem with Markov Random Fields
![Page 84: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/84.jpg)
Solving Panorama Problem with Markov Random Fields
ICM (Iterated Conditional Modes), 1986
![Page 85: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/85.jpg)
Solving Panorama Problem with Markov Random Fields
Belief Propagation (1980-2000)
![Page 86: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/86.jpg)
Solving Panorama Problem with Markov Random Fields
Graph-Cuts (alpha expansion), 2001
![Page 87: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/87.jpg)
Photosynthesis
![Page 88: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/88.jpg)
Solving Photosynthesis Problems with Alpha-matting (2000s-now)
![Page 89: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/89.jpg)
Object Detection & Classification state-of-the-art
• ImageNet Large Scale Visual Recognition Challenge (ILSVRC) • 1000+ classes, 1.2M images.
0
0.125
0.25
0.375
0.5
11 12 13 14 11 12 13 14classification
errorclassification
+localization error
![Page 90: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/90.jpg)
Image Scene Classification• MIT Places 401
dataset.
• top-5 accuracy rates >80%.
![Page 91: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/91.jpg)
Self-driving cars (2000s-now)
![Page 92: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/92.jpg)
DARPA Grand Challenge (2005)
![Page 93: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/93.jpg)
2005 winner, Stanley (Stanford), 3mph through desert
![Page 94: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/94.jpg)
![Page 95: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/95.jpg)
DARPA Urban Challenge (2007)
![Page 96: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/96.jpg)
2007 winner, Boss (CMU), 13mpg through the city
![Page 97: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/97.jpg)
Self Driving Cadillac, US congressman to airport, 2013
![Page 98: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/98.jpg)
Google Self Driving Car, 2015
![Page 99: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/99.jpg)
Google Self-Driving Car, 2016
![Page 100: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/100.jpg)
Google Self-Driving Car, 2016
![Page 101: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/101.jpg)
NVidia Self Driving Car, 2016
![Page 102: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/102.jpg)
How did we come this far? Race car drivers know the trick
![Page 103: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/103.jpg)
Focus on Free Space / Drivable Area, not Obstacles!
![Page 104: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/104.jpg)
Up-and-coming Computer Vision
Applications
![Page 105: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/105.jpg)
Structure from X, Floored
![Page 106: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/106.jpg)
Structure from X, PIX4D
![Page 107: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/107.jpg)
Object Recognition Blue River Technology
![Page 108: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/108.jpg)
Augmented Reality Magic Leap
![Page 109: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/109.jpg)
IMRSV
![Page 110: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/110.jpg)
Retail Insights
Source: Prism Skylabs
![Page 111: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/111.jpg)
Other Applications in Business Intelligence
• Measure brand exposure. • Measure sponsorship effectiveness. • Loss prevention and retail layout optimization.
![Page 112: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/112.jpg)
How about Smart Surveillance?
![Page 113: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/113.jpg)
Angel.co
![Page 114: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/114.jpg)
My humble attempts at putting the latest Computer Vision algorithms to work
![Page 115: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/115.jpg)
Intrinsic Imaging at Tandent Vision Science
Computer Vision would be half-solved without shadows!
LightOriginal Image Surface
![Page 116: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/116.jpg)
Tandent Lightbrush
Video Tutorial for Tandent Lightbrush: https://vimeo.com/47009123
![Page 117: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/117.jpg)
![Page 118: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/118.jpg)
Issues• Highly anticipated, highly acclaimed, but small
crowd at $500 a license.
• Adobe Photoshop monopoly and the “not invented here” syndrome.
• Adobe’s arch-rival, Corel (Corel Draw, Paint Shop Pro, Ulead PhotoImpact) was DYING and asked too much from the botched deal.
![Page 119: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/119.jpg)
Have fun scribbling out your shadows in photoshop!
Poor Bob from Adobe wasted 9 minutes removing just 1 shadow
![Page 120: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/120.jpg)
Intrinsic Imaging for improving the RGB signal in autonomous driving
![Page 121: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/121.jpg)
Intrinsic Imaging’s other applications
![Page 122: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/122.jpg)
Retrospect
• 20 researchers burned 25 million in 8 years; investors got 50 patents in return, period.
• Overestimated the total addressable market size, in a market with existing monopoly.
• Many missed opportunities. Counterexample of the lean startup model.
![Page 123: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/123.jpg)
Some SfM, SLAM startups
![Page 124: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/124.jpg)
Satellite/Aerial Imagery Analysis
• 40cm resolution at 30fps for 90 sec for any location on earth. • One LEO satellite revisits any place on Earth every 3 days. • Need 24 satellites to revisit any place on Earth every 3 hours.
![Page 125: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/125.jpg)
Challenges for Single satellite depth estimation and 3D reconstruction
• At 30fps, a LEO satellite travels 250m between two consecutive frames —> theoretically sufficient for cm-level depth estimation.
• Sources of Noise: • Camera distortions • Atmospheric Disturbance • Ground vegetation • Sub-pixel sampling noise
1 2
![Page 126: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/126.jpg)
What happened?
• B2B customers takes too long to strike deals.
• Google ate us alive in just 3 months, while we were still pitching for VC-funding with our prototype.
![Page 127: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/127.jpg)
Visual Search at Nervve
![Page 128: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/128.jpg)
Retrospect• Growth pains expanding from intelligence
community clients to advertisement clients. • Forming the right team of engineers and
researchers and moving at the right pace. • For any Computer Vision/Machine Learning
company: • Researchers that cannot program—> OUT • Engineers that don’t know math —> OUT
![Page 129: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/129.jpg)
Visual Search, Simply Smarter
![Page 130: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/130.jpg)
Once in a lifetime opportunity in China’s video streaming market
![Page 131: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/131.jpg)
What do we need?
Face MotionImage scene Text Audio Object
Semantics
![Page 132: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/132.jpg)
Viscovery VDS (Video Discovery Service)
![Page 133: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/133.jpg)
Viscovery VDS (Video Discovery Service)
![Page 134: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/134.jpg)
Viscovery VDS (Video Discovery Service)
![Page 135: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/135.jpg)
Challenges Encountered Along the Way
• From Product Recognition in Images, to Face, Logo, Object, Scene recognition in Videos. • Number of Categories • Recognition Accuracy • Recognition Speed
• System Architecture
• Business Model
![Page 136: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/136.jpg)
Viscovery’s Edge• Market: first mover’s advantage in China’s video
streaming market. • Speed: we built the whole VDS thing in a few months! • Team: You! Seriously! • Technology:
• Depth • Breadth • Cloud • Customizability • Self-Learning
![Page 137: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/137.jpg)
Life is not all rosy at startups
• High Risk, High Pressure, High Uncertainty!
• Resources are scarce, but you MUST DELIVER!
• Forming your all-star team is not that easy…
• Focus, and persistence.
![Page 138: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/138.jpg)
What can Taiwan’s academia do to help bridge the gap?
HMM….
![Page 139: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/139.jpg)
Academia
IndustryGeneral Public
reputation and policy support
improved living standards
students
opportunity
well-trained graduates
grants and collaborations
A healthy cycle
![Page 140: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/140.jpg)
Academia
IndustryGeneral Public
unsupportive policies
stagnant wages
useless education
unemployable graduates
A vicious cycle
no grants
no students
![Page 141: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/141.jpg)
Where should we start? Maybe with a few more stories.
![Page 142: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/142.jpg)
Where should we start? Maybe with a few more stories.
![Page 143: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/143.jpg)
Where should we start? Maybe with a few more stories.
![Page 144: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/144.jpg)
The Goldilocks zone of innovation
![Page 145: The Opportunities and Challenges of Putting the Latest Computer Vision and Deep Learning Algorithms to Work](https://reader036.vdocuments.mx/reader036/viewer/2022062503/5a64eee97f8b9aa2548b4ad5/html5/thumbnails/145.jpg)
The Goldilocks zone of innovation
Business Relevance
Academic Relevance
plentiful resources; hierarchical organization
lack of resources; responsive organization
traditional corporations talking “innovation”
corporate research
startups struggling to survive
academic spinoffs
MSR