0% found this document useful (0 votes)

29 views

ML Kit in Actions

mk scripts

Uploaded by

rockkyshakaib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

ML Kit in Actions

mk scripts

Uploaded by

rockkyshakaib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 86

ML Kit in Action (Android)

Mobile Things S02E03

When machine learning meets augmented reality

Qian JIN | @bonbonking | [email protected]

Image Credit: https://becominghuman.ai/part-1-migrate-deep-learning-training-onto-mobile-devices-c28029ffeb30

ML Kit in Action
• The building blocks of ML Kit
• Vision APIs: text recognition, face detection, barcode scanning,
image labeling, landmark recognition

• Custom Models
• Custom TensorFlow build
• General feedbacks
The building blocks of ML Kit
Google Cloud
Mobile Vision API + = ML Kit Vision APIs
Vision API

ML Kit Custom Models

TensorFlow Lite + Neural Network API =
/ TF Lite Build
Vision APIs
Vision:
You talking to me?
FirebaseVisionImage
• fromBitmap
• fromByteArray
• fromByteBuffer
• fromFilePath
• fromMediaImage
Text Recognition

On-device Cloud

Free for first 1000 uses of this feature per

Pricing Free
month

High-accuracy text recognition

Ideal use cases Real-time processing
Document scanning

Language A broad range of languages and special

Latin characters
support characters
INPUT FirebaseVisionImage

FirebaseVisionTextDetector

OUTPUT FirebaseVisionText
MobileThings: ML Kit in action

for (FirebaseVisionText.Block block: firebaseVisionText.getBlocks()) {

Rect boundingBox = block.getBoundingBox();
Point[] cornerPoints = block.getCornerPoints();
String text = block.getText();

for (FirebaseVisionText.Line line: block.getLines()) {

// ...
for (FirebaseVisionText.Element element: line.getElements()) {
// ...
}
}
}
Face Detection: Key Capabilities

• Recognise and locate facial features

• Recognise facial expressions
• Track faces across video frames
• Process video frames in real time
Face Orientation

Face tracking

Landmark

Classification
Face Orientation

• Euler X
• Euler Y
• Euler Z
Landmarks
• A landmark is a point of interest within a face. The left
eye, right eye, and nose base are all examples of
landmarks
Classification

• 2 classifications are supported: Eye open (left & right eye) & Smiling
• Inspiration: Android Things photo booth
Face Detection Options

FirebaseVisionFaceDetectorOptions options =
new FirebaseVisionFaceDetectorOptions.Builder()
.setModeType(FirebaseVisionFaceDetectorOptions.ACCURATE_MODE)
.setLandmarkType(FirebaseVisionFaceDetectorOptions.ALL_LANDMARKS)
.setClassificationType(FirebaseVisionFaceDetectorOptions.ALL_CLASSIFICATIONS)
.setMinFaceSize(0.15f)
.setTrackingEnabled(true)
.build();
INPUT FirebaseVisionImage

FirebaseVisionFaceDetector

OUTPUT List<FirebaseVisionFace>
!
➡ boundingBox: Rect
➡ trackingId: Int
➡ headEulerAngleY: Float
➡ headEulerAngleZ: Float
➡ smilingProbability: Float
➡ leftEyeOpenProbability: Float
➡ rightEyeOpenProbability: Float
Feedback

• Real-time application: pay attention to the image size

/fr.xebia.mlkitinactions E/pittpatt: online_face_detector.cc:236] inconsistent
image dimensions detector.cc:220] inconsistent image dimensions
/fr.xebia.mlkitinactions E/NativeFaceDetectorImpl: Native face detection failed
java.lang.RuntimeException: Error detecting faces.
at
com.google.android.gms.vision.face.NativeFaceDetectorImpl.detectFacesJni(Native
Method)
Image Labeling
On-device Cloud

Pricing Free Free for first 1000 uses of this feature per month

400+ labels that cover the most 10,000+ labels in many categories. See below.
Label coverage commonly-found concepts in Also, try the Cloud Vision API demo to see what
photos. See below. labels can be found for an image you provide.

Knowledge
Graph entity ID
support
INPUT FirebaseVisionImage

FirebaseVisionLabelDetector

OUTPUT List<FirebaseVisionLabel>
🎒 ➡ label: String
➡ confidence: Float
➡ entityId: String
Landmark Recognition

• Still in preview, using Cloud Vision API instead

• Recognizes well-known landmarks
• Get Google Knowledge Graph entity IDs
• Low-volume use free (first 1000 images)
Custom Model
Some night towards the end of 2016
!40
Android SDK (Java) Android NDK (C++)

Camera
1 Image (Bitmap) 2 input_tensor
Preview

Classifier TensorFlow
Implementation JNI wrapper
Trained Model

Overlay
Classifications + Confidence 4 top_results 3
Display

Ref: https://jalammar.github.io/Supercharging-android-apps-using-tensorflow/
Magritte
Ceci n’est pas une pomme.
Android Makers Paris, April 2017
Model Size

All weights are stored as they are (64-bit floats) => 80MB

!46
Weights Quantization
6.372638493746383 => 6.4

~80MB -> ~20MB

!47
Source: https://www.tensorflow.org/performance/quantization
Model Inception V3
Optimized & Quantized
Google I/O, May 2017
Google AI Blog, June 2017
MobileNet
Mobile-first computer vision models for TensorFlow

!52
Image credit : https://github.com/tensorflow/models/blob/master/research/slim/nets/mobilenet_v1.md
~80Mb => ~20Mb => ~1-5Mb

Source: https://research.googleblog.com/2017/06/mobilenets-open-source-models-for.html
DevFest Nantes, October 2017
Model MobileNets_0.25_224
Google I/O, May 2018
Custom Model: Key capabilities

• TensorFlow lite model hosting

• On-device ML inference
• Automatic model fallback
• Automatic model updates
Train your Convert model Host your TF Use the TF Lite
TF model to TF Lite Lite model on model for
(model.pb) (model.lite) Firebase inference

TOCO
(TensorFlow Lite
Optimizing
Converter)
How to train your dragon model?
Train your model
python -m tensorflow/examples/image_retraining/retrain.py \
--bottleneck_dir=tf_files/bottlenecks \
--how_many_training_steps=500 \
--model_dir=tf_files/models/ \
--summaries_dir=tf_files/training_summaries/ \
--output_graph=tf_files/retrained_graph.pb \
--output_labels=tf_files/retrained_labels.txt \
—architecture=mobilenet_0.50_224 \
--image_dir=tf_files/fruit_photos

Source: https://codelabs.developers.google.com/codelabs/tensorflow-for-poets/
TF Lite conversion for retrained quantized model is currently unavailable.
Firebase quickstart ML Kit sample only aimes quantized model.
Convert to tflite format
bazel run --config=opt \
//tensorflow/contrib/lite/toco:toco -- \
--input_file=/tmp/magritte_retrained_graph.pb \
--output_file=/tmp/magritte_graph.tflite \
--inference_type=FLOAT \
--input_shape=1,224,224,3 \
--input_array=input \
--output_array=final_result \
--mean_value=128 \
--std_value=128 \
--default_ranges_min=0 \
--default_ranges_max=6

Source: https://codelabs.developers.google.com/codelabs/tensorflow-for-poets/
Do you need custom bob models?
Use custom models if

• Specific needs CAN NOT be met by general purpose APIs

• Need high matching precision
• You are an experienced ML developer (or you know Yoann Benoit)

Let me train your model!

INPUT FirebaseModelInputs

FirebaseModelInterpreter

OUTPUT FirebaseModelOutputs
// input & output options for non-quantized model

val inputDims = intArrayOf(DIM_BATCH_SIZE, DIM_IMG_SIZE_X, DIM_IMG_SIZE_Y,

DIM_PIXEL_SIZE)
val outputDims = intArrayOf(1, labelList.size)
inputOutputOptions = FirebaseModelInputOutputOptions.Builder()
.setInputFormat(0, FirebaseModelDataType.FLOAT32, inputDims)
.setOutputFormat(0, FirebaseModelDataType.FLOAT32, outputDims)
.build()
// input & output options for non-quantized model

val inputDims = intArrayOf(DIM_BATCH_SIZE, DIM_IMG_SIZE_X, DIM_IMG_SIZE_Y,

ByteBuffer
🍎
🍌
🥝
🍊
OUTPUT FirebaseModelOutputs
🍓
🍇
🍉
🍋
🍍
Performance Benchmarks
Model MobileNets_1.0_224
Model MobileNets_1.0_224
• No callback or other feedback for model downloading
• Model downloading seems to be blocking => do not use on main
thread

• Lack of documentations at this point (e.g. how to stop the

interpreter?)

• Slight performance loss comparing to TensorFlow Lite

• A/B test your machine learning model!
HowTo: Face Recognition Model

• Trained with Keras + FaceNet

• Converted to TensorFlow
• Then converted to TensorFlow lite
• Then we got stuck…
Custom TensorFlow Lite build
Custom TF Lite build

• ML Kit uses a pre-built TensorFlow Lite library

• Build your own AAR with bazel
• Add custom ops for example
Takeaway
ML Kit: State of the art

• Lack of high quality demos (e.g. firebase mlkit quickstart, bugs,

deprecated camera API, deformed camera preview)

• Lack of high level guidelines / best practises

• Performance issue on old devices
The best is yet to come

• Face contours: 100 data points

• Smart Reply: conversation model
• Online model compression
References
References
• Talk Magritte for DroidCon London
• Medium article: Android meets Machine Learning
• Github Repo for demo
• Joe Birch: Exploring Firebase MLKit on Android: Introducing MLKit (Part one)
• Joe Birch: Exploring Firebase MLKit on Android: Face Detection (Part Two)
• Merci Sandra ;)
Questions?

OpenJS Node.js Application Developer (JSNAD) Certification Guide
From Everand
OpenJS Node.js Application Developer (JSNAD) Certification Guide
Liora Venith
No ratings yet
Hands-On Artificial Intelligence for Android: Understand Machine Learning and Unleash the Power of TensorFlow in Android Applications with Google ML Kit
From Everand
Hands-On Artificial Intelligence for Android: Understand Machine Learning and Unleash the Power of TensorFlow in Android Applications with Google ML Kit
Vasco Correia Veloso
No ratings yet
Test-Driven iOS Development with Swift: Create fully-featured and highly functional iOS apps by writing tests first
From Everand
Test-Driven iOS Development with Swift: Create fully-featured and highly functional iOS apps by writing tests first
Dr. Dominik Hauser
5/5 (2)
Kinect Open Source Programming Secrets: Hacking the Kinect with OpenNI, NITE, and Java
From Everand
Kinect Open Source Programming Secrets: Hacking the Kinect with OpenNI, NITE, and Java
Andrew Davison
No ratings yet
Opengl Project
81% (21)
Opengl Project
52 pages
Machine-Learning-in-Android-P
No ratings yet
Machine-Learning-in-Android-P
41 pages
Daniel Situnayake - Tensorflow Lite - Qcon SF
No ratings yet
Daniel Situnayake - Tensorflow Lite - Qcon SF
104 pages
InternGrow Android Curriculum - AndroidCurriculum - LGM
No ratings yet
InternGrow Android Curriculum - AndroidCurriculum - LGM
17 pages
Face Detection On Android With Google ML Kit - CodeProject
No ratings yet
Face Detection On Android With Google ML Kit - CodeProject
12 pages
Chapter 9 - Artificial Intelligence For Mobile Apps
No ratings yet
Chapter 9 - Artificial Intelligence For Mobile Apps
30 pages
GSoC 2024 Project Proposal-Tensorflow (Final)
No ratings yet
GSoC 2024 Project Proposal-Tensorflow (Final)
15 pages
Arun Mani Sam, R&D Software Engineer
No ratings yet
Arun Mani Sam, R&D Software Engineer
21 pages
Conversations with: AI: Developer edition, #1
From Everand
Conversations with: AI: Developer edition, #1
Xinc Cyberwizard
No ratings yet
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
From Everand
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
Abdulrazak Nugwa Ibrahim
5/5 (1)
Building AI Applications with Microsoft Semantic Kernel: Easily integrate generative AI capabilities and copilot experiences into your applications
From Everand
Building AI Applications with Microsoft Semantic Kernel: Easily integrate generative AI capabilities and copilot experiences into your applications
Lucas A. Meyer
No ratings yet
MML MP
No ratings yet
MML MP
10 pages
RemovePagesResult_2025_04_16_06_39_14
No ratings yet
RemovePagesResult_2025_04_16_06_39_14
21 pages
Azure Bicep QuickStart Pro: From JSON and ARM Templates to Advanced Deployment Techniques, CI/CD Integration, and Environment Management
From Everand
Azure Bicep QuickStart Pro: From JSON and ARM Templates to Advanced Deployment Techniques, CI/CD Integration, and Environment Management
Selina Threxan
No ratings yet
Azure Bicep QuickStart Pro
From Everand
Azure Bicep QuickStart Pro
Selina Threxan
No ratings yet
Nivetha Me P2 PPT
No ratings yet
Nivetha Me P2 PPT
18 pages
How To Program A Mobile Game
From Everand
How To Program A Mobile Game
Duong Tran
4/5 (1)
Image Collection Exploration: Unveiling Visual Landscapes in Computer Vision
From Everand
Image Collection Exploration: Unveiling Visual Landscapes in Computer Vision
Fouad Sabry
No ratings yet
Learn Professional Programming in .Net Using C#, Visual Basic, and Asp.Net
From Everand
Learn Professional Programming in .Net Using C#, Visual Basic, and Asp.Net
Adalat Khan
No ratings yet
Babylon.js Essentials: Understand, train, and be ready to develop 3D Web applications/video games using the Babylon.js framework, even for beginners
From Everand
Babylon.js Essentials: Understand, train, and be ready to develop 3D Web applications/video games using the Babylon.js framework, even for beginners
Julien Moreau-Mathis
No ratings yet
Mastering D3.js
From Everand
Mastering D3.js
Pablo Navarro Castillo
3/5 (1)
Learning Google Cloud Vertex AI: Build, deploy, and manage machine learning models with Vertex AI (English Edition)
From Everand
Learning Google Cloud Vertex AI: Build, deploy, and manage machine learning models with Vertex AI (English Edition)
Hemanth Kumar K
No ratings yet
Classifying Images Using Keras MobileNet and TensorFlow - Js in Google Chrome - Gogul Ilango
No ratings yet
Classifying Images Using Keras MobileNet and TensorFlow - Js in Google Chrome - Gogul Ilango
15 pages
Transfer Learning Models.docx
No ratings yet
Transfer Learning Models.docx
5 pages
Amazon SimpleDB: LITE
From Everand
Amazon SimpleDB: LITE
Prabhakar Chaganti
No ratings yet
Lab7 - Android Sensors ML
No ratings yet
Lab7 - Android Sensors ML
41 pages
Tapestry 5: Building Web Applications
From Everand
Tapestry 5: Building Web Applications
Alexander Kolesnikov
3.5/5 (2)
Entity Framework Tutorial - Second Edition
From Everand
Entity Framework Tutorial - Second Edition
Joydip Kanjilal
No ratings yet
Node.js for Beginners: A comprehensive guide to building efficient, full-featured web applications with Node.js
From Everand
Node.js for Beginners: A comprehensive guide to building efficient, full-featured web applications with Node.js
Ulises Gascón
No ratings yet
Xamarin 4.x Cross-Platform Application Development - Third Edition
From Everand
Xamarin 4.x Cross-Platform Application Development - Third Edition
Jonathan Peppers
No ratings yet
OpenJS Node.js Application Developer (JSNAD) Certification Guide: A complete practical study guide to become a node.js certified developer with 100+ sample programs demonstrated
From Everand
OpenJS Node.js Application Developer (JSNAD) Certification Guide: A complete practical study guide to become a node.js certified developer with 100+ sample programs demonstrated
Liora Venith
No ratings yet
How To Create An App
From Everand
How To Create An App
Duong Tran
3/5 (8)
Learning Azure DocumentDB
From Everand
Learning Azure DocumentDB
Becker Riccardo
No ratings yet
Learning Kibana 5.0
From Everand
Learning Kibana 5.0
Bahaaldine Azarmi
No ratings yet
Visual Studio 2013 and .NET 4.5 Expert Cookbook
From Everand
Visual Studio 2013 and .NET 4.5 Expert Cookbook
Abhishek Sur
4/5 (3)
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
From Everand
Composing Software: An Exploration of Functional Programming and Object Composition in JavaScript
Eric Elliott
No ratings yet
Learning Xamarin Studio
From Everand
Learning Xamarin Studio
William Smith
No ratings yet
Learning AngularJS Animations
From Everand
Learning AngularJS Animations
Richard Keller
4/5 (2)
Entity Framework Core
From Everand
Entity Framework Core
Kenji Elzerman
No ratings yet
Practical Rust 1.x Cookbook: 100+ Solutions across Command Line, CI/CD, Kubernetes, Networking, Code Performance and Microservices
From Everand
Practical Rust 1.x Cookbook: 100+ Solutions across Command Line, CI/CD, Kubernetes, Networking, Code Performance and Microservices
Rustacean Team
No ratings yet
Practical Rust 1.x Cookbook
From Everand
Practical Rust 1.x Cookbook
Rustacean Team
No ratings yet
Prompt to Profit: AI Patterns That Give Solo Builders an Unfair Advantage
From Everand
Prompt to Profit: AI Patterns That Give Solo Builders an Unfair Advantage
Lucas Merritt
No ratings yet
Learning DHTMLX Suite UI
From Everand
Learning DHTMLX Suite UI
Eli Geske
No ratings yet
712 A Guide To Turi Create PDF
100% (1)
712 A Guide To Turi Create PDF
119 pages
Learning C++ by Creating Games with UE4
From Everand
Learning C++ by Creating Games with UE4
William Sherif
3/5 (7)
Beginning with Deep Learning Using TensorFlow: A Beginners Guide to TensorFlow and Keras for Practicing Deep Learning Principles and Applications
From Everand
Beginning with Deep Learning Using TensorFlow: A Beginners Guide to TensorFlow and Keras for Practicing Deep Learning Principles and Applications
Mohan Kumar Silaparasetty
No ratings yet
Learning UnderscoreJS: Explore the Underscore.js library by example using a test-driven development approach
From Everand
Learning UnderscoreJS: Explore the Underscore.js library by example using a test-driven development approach
Alexandru Vasile Pop
No ratings yet
jQuery Design Patterns
From Everand
jQuery Design Patterns
Greasidis Thodoris
No ratings yet
Spring 2.5 Aspect Oriented Programming
From Everand
Spring 2.5 Aspect Oriented Programming
Massimiliano DessÃ¬
No ratings yet
HTML5,CSS3,Javascript and JQuery Mobile Programming: Beginning to End Cross-Platform App Design
From Everand
HTML5,CSS3,Javascript and JQuery Mobile Programming: Beginning to End Cross-Platform App Design
Stephen J Link
5/5 (3)
PhoneGap By Example
From Everand
PhoneGap By Example
Andrey Kovalenko
5/5 (1)
Getting Started with NativeScript: Explore the possibility of building truly native, cross-platform mobile applications using your JavaScript skill—NativeScript!
From Everand
Getting Started with NativeScript: Explore the possibility of building truly native, cross-platform mobile applications using your JavaScript skill—NativeScript!
Nathanael J. Anderson
No ratings yet
.Net Framework and Programming in ASP.NET
From Everand
.Net Framework and Programming in ASP.NET
Priyanka Agarwal
No ratings yet
.NET MAUI Cross-Platform Application Development: Build high-performance apps for Android, iOS, macOS, and Windows using XAML and Blazor with .NET 8
From Everand
.NET MAUI Cross-Platform Application Development: Build high-performance apps for Android, iOS, macOS, and Windows using XAML and Blazor with .NET 8
Roger Ye
No ratings yet
Mastering CryENGINE
From Everand
Mastering CryENGINE
Sascha Gundlach
No ratings yet
Imxtflug: I.Mx Tensorflow Lite On Android User'S Guide
No ratings yet
Imxtflug: I.Mx Tensorflow Lite On Android User'S Guide
42 pages
TypeScript Blueprints
From Everand
TypeScript Blueprints
Ivo Gabe de Wolff
No ratings yet
A Deep Learning-Based Framework For Offensive Text Detection in Unstructured Data For Heterogeneous Social Media
No ratings yet
A Deep Learning-Based Framework For Offensive Text Detection in Unstructured Data For Heterogeneous Social Media
15 pages
Scanvo APP
No ratings yet
Scanvo APP
24 pages
Industrial Automation (20MC44P)
No ratings yet
Industrial Automation (20MC44P)
119 pages
Blood
No ratings yet
Blood
5 pages
Tesseract Training - For Khmer Language - For Posting
100% (3)
Tesseract Training - For Khmer Language - For Posting
8 pages
1.5 Systems Software EOU Quiz
No ratings yet
1.5 Systems Software EOU Quiz
5 pages
Computer Hardware Test Data
No ratings yet
Computer Hardware Test Data
6 pages
Chapter 3 - Input & Output Devices
No ratings yet
Chapter 3 - Input & Output Devices
25 pages
Cognex Deep Learning: Textile Inspection Industry Overview
No ratings yet
Cognex Deep Learning: Textile Inspection Industry Overview
7 pages
AI Unit 2
No ratings yet
AI Unit 2
36 pages
Proposal For Automatic License and Number Plate Recognition System For Vehicle Identification
No ratings yet
Proposal For Automatic License and Number Plate Recognition System For Vehicle Identification
5 pages
Docling Technical Report
No ratings yet
Docling Technical Report
9 pages
Grade 8 ICT/TLE
No ratings yet
Grade 8 ICT/TLE
4 pages
DRN and Vigilant v. Beebe and McDaniel - Complaint
No ratings yet
DRN and Vigilant v. Beebe and McDaniel - Complaint
17 pages
Bizhub 450i Datasheet
No ratings yet
Bizhub 450i Datasheet
4 pages
SER - Catalog
No ratings yet
SER - Catalog
25 pages
RAHAT AI AGENT.docx_20250215_173118_0000
No ratings yet
RAHAT AI AGENT.docx_20250215_173118_0000
57 pages
Data Validation and Verification
100% (1)
Data Validation and Verification
18 pages
Introduction To Computing - 2
No ratings yet
Introduction To Computing - 2
11 pages
CAT TERM 1 Test Memo
No ratings yet
CAT TERM 1 Test Memo
4 pages
Basic Input Devices Table
No ratings yet
Basic Input Devices Table
4 pages
Practical-Assignment Final Ok
100% (1)
Practical-Assignment Final Ok
69 pages
Metadata Developing Body
No ratings yet
Metadata Developing Body
23 pages
Mechatronics 2017 Ideas For Industrial Applications Jerzy Świder Download PDF
100% (9)
Mechatronics 2017 Ideas For Industrial Applications Jerzy Świder Download PDF
49 pages
Automatic Parking Management System and Parking Fee Collection Based On Number Plate Recognition
No ratings yet
Automatic Parking Management System and Parking Fee Collection Based On Number Plate Recognition
7 pages
OCR Technical Documentation and Software Manual
No ratings yet
OCR Technical Documentation and Software Manual
14 pages
ICT Revision Guide
100% (1)
ICT Revision Guide
40 pages
Market Research Accounts Payable Invoice Automation Solutions
No ratings yet
Market Research Accounts Payable Invoice Automation Solutions
15 pages
528227-001B Instant ID Tech Specs
No ratings yet
528227-001B Instant ID Tech Specs
29 pages

ML Kit in Actions

Uploaded by

ML Kit in Actions

Uploaded by

ML Kit in Action (Android)

Mobile Things S02E03

Qian JIN | @bonbonking | [email protected]

Image Credit: https://becominghuman.ai/part-1-migrate-deep-learning-training-onto-mobile-devices-c28029ffeb30

ML Kit Custom Models

Free for first 1000 uses of this feature per

High-accuracy text recognition

Language A broad range of languages and special

for (FirebaseVisionText.Block block: firebaseVisionText.getBlocks()) {

for (FirebaseVisionText.Line line: block.getLines()) {

• Recognise and locate facial features

• Real-time application: pay attention to the image size

• Still in preview, using Cloud Vision API instead

~80MB -> ~20MB

• TensorFlow lite model hosting

• Specific needs CAN NOT be met by general purpose APIs

Let me train your model!

val inputDims = intArrayOf(DIM_BATCH_SIZE, DIM_IMG_SIZE_X, DIM_IMG_SIZE_Y,

val inputDims = intArrayOf(DIM_BATCH_SIZE, DIM_IMG_SIZE_X, DIM_IMG_SIZE_Y,

• Lack of documentations at this point (e.g. how to stop the

• Slight performance loss comparing to TensorFlow Lite

• Trained with Keras + FaceNet

• ML Kit uses a pre-built TensorFlow Lite library

• Lack of high quality demos (e.g. firebase mlkit quickstart, bugs,

• Lack of high level guidelines / best practises

• Face contours: 100 data points

You might also like