Google Developers Italia: Potenzia i tuoi modelli di visione artificiale con l'API Object Detection di TensorFlow

Local blog for Italian speaking developers

Potenzia i tuoi modelli di visione artificiale con l'API Object Detection di TensorFlow

28 giugno 2017

code { background-color: transparent }Pubblicato da Jonathan Huang, Research Scientist e Vivek Rathod, Software Engineerstimolare i progressi compiuti nell'ambito della comunità di ricerca

Oggetti rilevati in un'immagine campione (dal set di dati COCO) creati da uno dei nostri modelli. Crediti per l'immagine: Michael Miley, immagine originale.

competizione di rilevamento COCO1,2,3,4,5,6,7NestCamOggetti simili e idee di stilerilevamento di nome e numero civicoAPI Object Detection di TensorFlowTensorFlow

Una selezione di modelli di rilevamento addestrabili, tra cui:

Single Shot Multibox Detector (SSD) con MobileNets
SSD con Inception V2
Region-Based Fully Convolutional Networks (R-FCN) con Resnet 101
Faster RCNN con Resnet 101
Faster RCNN con Inception Resnet v2

Carichi calibrati e fissati (addestrati in base al set di dati COCO) per ciascuno dei modelli precedenti tali da poter essere usati per soluzioni di inferenza predefinita

Jupyter Notebook per eseguire l'inferenza predefinita con uno dei modelli rilasciati

Convenienti script di formazione locali nonché pipeline di formazione e di valutazione distribuite tramite Google Cloud

paper CVPR 2017Sei pronto per iniziare?quiJupyter Notebookaddestrando il tuo rilevatore di animali domestici nel motore Cloud ML
RiconoscimentiContributori principali:Derek Chow, Chen Sun, Menglong Zhu, Matthew Tang, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Jasper Uijlings, Viacheslav Kovalevskyi, Kevin MurphyUn ringraziamento particolare a:Andrew Howard, Rahul Sukthankar, Vittorio Ferrari, Tom Duerig, Chuck Rosenberg, Hartwig Adam, Jing Jing Long, Victor Gomes, George Papandreou, Tyler ZhuRiferimenti

Speed/accuracy trade-offs for modern convolutional object detectors, Huang et al., CVPR 2017 (paper che descriive questo framework)

Towards Accurate Multi-person Pose Estimation in the Wild, Papandreou et al., CVPR 2017

YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video, Real et al., CVPR 2017 (vedi anche il nostro post del blog )

Beyond Skip Connections: Top-Down Modulation for Object Detection, Shrivastava et al., arXiv preprint arXiv:1612.06851, 2016

Spatially Adaptive Computation Time for Residual Networks, Figurnov et al., CVPR 2017

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions, Gu et al., arXiv preprint arXiv:1705.08421, 2017

MobileNets: Efficient convolutional neural networks for mobile vision applications, Howard et al., arXiv preprint arXiv:1704.04861, 2017

Etichette

Android Firebase machine learning Google Cloud Platform GDL Eventi Google Developers Live Google Play TensorFlow App Chrome Cloud api GDLItalia GDE GDG Google Assistant iOS Kotlin Actions on Google Deep Learning AppEngine AMP BigQuery Cloud Functions Flutter Android Studio Google Developers Expert Università Google AppEngine JavaScript AI Android Wear GAE Google Play Store HTML5 Maps security Android App Development AngularJS IoT Kubernetes Annunci Cloud Firestore Cloud Machine Learning Google I/O Polymer Android Things Community DevTools Google App Engine intelligenza artificiale Entrepreneurship Firebase Analytics GSoC Games Google Cast ML open source Crashlytics Dart Diversity Drive Google Data Studio Google Play Games TensorFlow Lite Android Developers Android O Cloud Spanner Cloud TPU Compute Engine DevFest Google Compute Engine Google Developers Material Design Mobile PWA Python Startup AIY Project ARCore Android Jetpack AndroidDev Androidq Apps Script Artificial Intelligence Augmented Reality Firebase Cloud Messaging Google Cloud Google Maps Gsuite IO19 ML kit Research VR coding unity #io19 AR Android Dev Summit Android Developer Android Q Cardboard Cloud AI Coral Developers Dialogflow Firebase Realtime Database Gmail Google AI Google Cloud Messaging Google ContainerEngine Google Play Console Kotlin Coroutines NLP Programming Responsive Design TensorFlowjs Testing WTM Women beacons cloud storage developer node JS student programs women techmakers API Cloud Vision Add-ons Android P AndroidDevStory Animation AutoML Brillo Classroom DSC Database Developer Student Clubs Edge TPU Fabric Featured Flutter Web G Suite GWT GoLang Google Google Brain Google Cloud Next Google Container Engine Google Developer Groups Google I/O Extended Graph Hosting Instant Apps Keras Livedata Mobile Sites Prediction Privacy Project Tango SDK Stackdriver Tales UI Udacity Virtual Reality Web Web Development YouTube analytics android security api.ai courses google io indies natural language processing reti neurali sign-in young developers 2d Animation 3d AIY ARkit Adversarial Learning Alpha Android App Android App Developmen Android App bundle Android Architecture Android Architecture Components Android Auto Android Automotive OS Android Dev Summit Android Developer Android Developer Challenge Android Developers GooglePlayAwards Android Development Android Go Android Instant App Android Pie Android Q Scoped Storage Android Q audio Android Styles Android audio playback capture Android codelabs AndroidTV AndroidX Angular Aogdevs Api Design App Development App Distribution Apps Architecture Architecture Components Arduino Best Practices Betatesting Bugs C++ Certification Cloud Anchors Cloud Next Cloud Run Cloud Service Platform Cloud Shell Cloud Study Jam Coached Conversational Preference Elicitation Commerce Community Connector Computer Science Consistency Containers Converge Conversation Design Crash Reporting DLS Design Dagger Data Science Databases Dependency Injection Design Developer Communities Developer Community Developer Culture Developer Story Developing Media Apps Development Eager Edge TPU Dev Board Education Emulatore Android Error Message Eslint Europe Firebase Extensions Firebase Summit 2019 Firebasehosting Flutter 1.5 Flutter at IO FlutterDark GCE GDD Game Development Gboard Gesture Navigation Glass Go Google AI Quantum Google App Script Google Cloud Functions Google Cloud billing Google Coral Google Developer Days Google Home Hub Google IOS Android Google Identity Platform Google Launchpad Google Lens Google Now Google Photos Google Play Devs Google Play Indie Games Festival Google Play Instant Google Plus Google codelabs Google+ GoogleDevWeekly GoogleLaunchpad GooglePlay Graphics Healthcare I/O IO IO19 Flutter In-app Billing Indie Games Indie Games Festival Indie games showcase Indie showcase Ingress Instant Games Issues Java Jetpack Knative Kotlin Beginners Kotlin Everywhere Kotlin codelabs Lighthouse Live Caption Live Streaming Localization Location M-Theory Mondaygram Monetization NYT NativeScript Navigation Neural Graph Learning Neural Structured Nodejs OS OS Updates Olivex One Time Codes Online Education PHA Performance Monitoring Policy Posenet Project Mainline Project Treble Quantum Computing Theory Reactive Programming Regression Remote Config Resonance Audio Room Scoped Storage Semantics Semi Supervised Learning Serverless Sms Retriever Api Sms Verification Speech Recognition Swift Tensorflow Core Tensorflow Hub Test Lab Text Tokenizer Tpu Transformers UX UX Design UX Research Universal Sentence Encoder Unsupervised Data Augmentation Unsupervised Learning User Experience Viewmodel Voice WWW Wear OS WebAssembly Widget Women in Tech WomenTechmakers android kotlin app stability assistant audio recording augmented faces authsub best practices and updates billing botnet business c++ games cancer chatbot chrome privacy codelab codelabs competition daydream designer dominio .dev error handling event firebase games firebase gdc firebase hosting firebase unity game center authentication game testing games authentication gdc google summer of code googledevelopers grow hashcode indie indie developers internship kids machine intelligence machine learning accelerator maker multi-platform nearby oauth openid performance persistent AR privacy sandbox prizes prototype purchase flows queries realtime responsible AI security rules showcase solutions challenge startup africa roadtrip startup times students summer of code unity crashlytics verify apps win

Archivio Blog

2020
- feb
- gen

2019
- dic
- nov
- ott
- set
- ago
- lug
- giu
- mag
- apr
- mar
- feb
- gen

2018
- dic
- nov
- ott
- set
- ago
- lug
- giu
- mag
- apr
- mar
- feb
- gen

2017
- dic
- nov
- ott
- set
- ago
- lug
- giu
- mag
- apr
- mar
- feb
- gen

2016
- dic
- nov
- ott
- set
- ago
- lug
- giu
- mag
- apr
- mar
- feb
- gen

2015
- dic
- nov
- ott
- set
- ago
- lug
- giu
- mag
- apr
- mar
- feb
- gen

2014
- dic
- nov
- ott
- set
- ago
- lug
- giu
- mag
- apr
- mar
- feb
- gen

2013
- dic
- nov
- ott
- set
- ago
- lug
- giu
- mag
- apr
- mar
- feb
- gen

Feed

Follow @GoogleDevsItaly

Google
Privacy
Terms