ArffaLimRachleff LearningToCook Poster

Research papers

Uploaded by

sojogil742

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

ArffaLimRachleff LearningToCook Poster

Research papers

Uploaded by

sojogil742

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Learning to Cook – An Exploration of Recipe Data

Travis Arffa (tarffa), Rachel Lim (rachelim), and Jake Rachleff (jakerach)

Goals Data/Features Random Forest

We set out to solve two problems. First, we wanted to Data: We scraped all recipes currently on Epicurious.com for Model: Random Forest uses randomized samples of data
figure out the different “types” of recipes based purely on our data set. For each recipe, we scraped its ingredients, to fit several smaller regression trees. It outputs a
what ingredients were included, which would allow us to preparation steps, nutritional information, cook time, and user prediction that is the average output of each tree.
understand which ingredients are prevalent in which type ratings (ranging 0-100). We filtered out recipes with fewer Randomization reduces correlation between trees, and the
of cuisine. Second, we wanted to predict recipe review than 15 ratings, and collected 10,440 recipes in total. For use of multiple trees counteracts overfit. We chose RF due
scores based on recipe ingredients and real valued clustering, we used all the recipe data. For prediction, we to the large number of ingredients and potential overfit to
features such as nutrition score and number of steps. randomly partitioned the dataset into training and test sets ingredients that occur frequently in the train data.
constituting 80% and 20% of the recipes respectively.
(1) Clustering Recipes Features: For Naive Bayes, Clustering, and Random Forest, Full Reduced

Model: We sought to define a cuisine based solely on its we used hand-curated R355 binary feature vectors of Min. Leaf Size 50 5

ingredients and no preconceived notions about cuisine ingredients. We used simple features like number of steps, Num. Trees 100 10

itself. Thus, we found the unsupervised learning strategy of number of ingredients for linear regressions, and expanded Num. Predictors 355 15

k-means clustering to be the best model for this task, which the ingredient features for Naive Bayes as well. Sub-Sample Size 200 200

we could then verify with a recipe’s tags.

(2) Recipe Rating Prediction
Abs Test Error 6.07 6.08

Results and discussion:

Varying the number of clusters, To learn the quality of a recipe (measured by its rating), we Results: RF had an average absolute test error around
we obtained the following graph tried several different machine learning models, including 6.08, and MSE of 86.5. RF outperformed other regression
of total squared error. We see
an inflection point around k=3,
linear regression, locally weighted linear regression, Naive
Bayes, and Random Forest. We discuss the latter two
`
techniques that we attempted, including linear regression
and locally-weighted linear regression.
suggesting 3 as the optimal models in depth. Discussion: The test errors for each model indicate that
number of clusters. additional features and tree complexity did not yield more
Naive Bayes accurate predictions. Fifteen predictors were chosen for
Model: Naive Bayes is a probabilistic model for the reduced model based on the Out-of-Bag Variable
classification that assumes the occurrence of features is Importance parameter, which equals the average
conditionally independent given the class variable. While difference between tree outputs that included the feature,
this assumption does not hold in the case of recipes, it is a and those that did not. The sample size was kept constant
good baseline model for prediction. to account for the sparsity of the feature vectors.
Data: We discretized the ratings into evenly-sized buckets
The graphs shows clusters for k=3 and k=4. For k=3,
inspecting the tags of recipes belonging to each cluster, we
and performed multiclass Naive Bayes classification, Future Research
experimenting with number of buckets and feature type (a
observe that these clusters correspond to meals, drinks and The next step for our project would be to auto-generate
R355 binary feature vector of ingredients, and a R126025
desserts. We also observe an interesting trend: as we binary feature vector of paired ingredients). recipes using the clustered tags while striving to
increased the number of clusters, these recipe classes were maximize ratings. This would represent a combination
Results and discussion: Prediction Mean Absolute Error
split further into natural subclasses. When k increases from of the supervised and unsupervised techniques
works better with pairwise ingredient Number of 10 20
3 to 4, the cluster corresponding to ‘meals’ (in purple) is currently presented, as well as additional modeling to
features. We expect this to be the case, buckets
split into Asian and European cuisines. For each increase of Basic 8.41 9.42 account for varying amounts of each ingredient.
since the conditional independence
k past the kink, we still discover new cuisine types with ingredient Another interesting avenue of research would be to look
assumption does not hold for recipe features
similar flavors based on their tags and ingredients, meaning at how the ingredients and amounts of each ingredient
ingredients, and ingredients tend to “go Pairwise 7.78 8.84
that our most informative cluster sizes were not dependent ingredient correspond to nutritional value.
well together”. features
on cluster error graph’s inflection.

Numsense! Data Science For The Layman
100% (3)
Numsense! Data Science For The Layman
65 pages
Enhancing Personalized Recipe Recommendation Through Multiclass Classification
No ratings yet
Enhancing Personalized Recipe Recommendation Through Multiclass Classification
11 pages
Literature Survey
No ratings yet
Literature Survey
16 pages
CSE190 ML Recipe Cuisines Paper-2015
No ratings yet
CSE190 ML Recipe Cuisines Paper-2015
7 pages
SE Paper Chapter 2
No ratings yet
SE Paper Chapter 2
7 pages
Final 1
No ratings yet
Final 1
10 pages
1an Intelligent Approach For Food Recipe Rating Prediction Using Machine Learning
No ratings yet
1an Intelligent Approach For Food Recipe Rating Prediction Using Machine Learning
3 pages
Recipe Recommendation
No ratings yet
Recipe Recommendation
9 pages
Food Recommendation System
No ratings yet
Food Recommendation System
13 pages
Recommendation of Indian Recipes Based On Ingredients
No ratings yet
Recommendation of Indian Recipes Based On Ingredients
18 pages
Recipe Recommendation Based on Ingredien
No ratings yet
Recipe Recommendation Based on Ingredien
5 pages
Henrique Carlos 78344 Extended Abstract
No ratings yet
Henrique Carlos 78344 Extended Abstract
8 pages
289
No ratings yet
289
1 page
A Recommender System For Healthy and Personalized Recipe Recommendations
No ratings yet
A Recommender System For Healthy and Personalized Recipe Recommendations
7 pages
Food - Recipe Jornal-1
No ratings yet
Food - Recipe Jornal-1
10 pages
EvoRecipes A Generative Approach For Evolving Cont
No ratings yet
EvoRecipes A Generative Approach For Evolving Cont
18 pages
Pixel To Plate Transforming Food Images Into Recipes - Document 1
No ratings yet
Pixel To Plate Transforming Food Images Into Recipes - Document 1
50 pages
RecipeMate A Food Media Recommendation System Based On Regional Raw Ingredients
No ratings yet
RecipeMate A Food Media Recommendation System Based On Regional Raw Ingredients
6 pages
Learning Cross-Modal Embeddings For Cooking Recipes and Food Images
No ratings yet
Learning Cross-Modal Embeddings For Cooking Recipes and Food Images
9 pages
Flavor Network and The Principles of Food Pairing: Scientific Reports
No ratings yet
Flavor Network and The Principles of Food Pairing: Scientific Reports
7 pages
Ahn Et Al. - 2011 - Flavor Network and The Principles of Food Pairing
No ratings yet
Ahn Et Al. - 2011 - Flavor Network and The Principles of Food Pairing
36 pages
IEEE Paper 2023
No ratings yet
IEEE Paper 2023
2 pages
fin_irjmets1715525141
No ratings yet
fin_irjmets1715525141
5 pages
CGAS - Concepts From Computational Biology
No ratings yet
CGAS - Concepts From Computational Biology
105 pages
Flavor Network and The Principles of Food Pairing
No ratings yet
Flavor Network and The Principles of Food Pairing
39 pages
Problem Description
No ratings yet
Problem Description
2 pages
KYK_Final_Project_Report
No ratings yet
KYK_Final_Project_Report
4 pages
Final Project Report DA
No ratings yet
Final Project Report DA
3 pages
SCA21 Paper 102
No ratings yet
SCA21 Paper 102
11 pages
Virtual Fridge
No ratings yet
Virtual Fridge
2 pages
Presented By: Group 7 Abinaya Karunagaran Archana Sivadas Eswar Sai Santosh Nikhil Gadalay Satish Reddy Muppidi Sukumar Reddy Jeenigiri
No ratings yet
Presented By: Group 7 Abinaya Karunagaran Archana Sivadas Eswar Sai Santosh Nikhil Gadalay Satish Reddy Muppidi Sukumar Reddy Jeenigiri
15 pages
Problem Description
No ratings yet
Problem Description
7 pages
Restaurant Success Prediction
No ratings yet
Restaurant Success Prediction
14 pages
IRJET-V9I3170
No ratings yet
IRJET-V9I3170
3 pages
2-FlavorMiner a Machine Learning Platform for Extracting Molecular Flavor Profiles From Structural Data
No ratings yet
2-FlavorMiner a Machine Learning Platform for Extracting Molecular Flavor Profiles From Structural Data
12 pages
Machine Learning Models For Detecting Recipe Site Traffic: Tasty Bytes
No ratings yet
Machine Learning Models For Detecting Recipe Site Traffic: Tasty Bytes
12 pages
DatasetRecipes5k
No ratings yet
DatasetRecipes5k
2 pages
Real Time Recipe Recommendation Based On Ingredients They Have at Home Using TF-IDF Algorithm
No ratings yet
Real Time Recipe Recommendation Based On Ingredients They Have at Home Using TF-IDF Algorithm
5 pages
2023 Prediction Modeling Using Deep Learning For The CL MD Nurul Raihen
No ratings yet
2023 Prediction Modeling Using Deep Learning For The CL MD Nurul Raihen
12 pages
research_paper_diet_2
No ratings yet
research_paper_diet_2
6 pages
20BCE1477 Internship Report
No ratings yet
20BCE1477 Internship Report
16 pages
Monsoon 2020 - Week 03: Computational Gastronomy
No ratings yet
Monsoon 2020 - Week 03: Computational Gastronomy
6 pages
Multi-Task Learning For Calorie Prediction On A Novel Large-Scale Recipe Dataset Enriched With Nutritional Information
No ratings yet
Multi-Task Learning For Calorie Prediction On A Novel Large-Scale Recipe Dataset Enriched With Nutritional Information
8 pages
Requirements
No ratings yet
Requirements
2 pages
Let's Talk Flavor Pairing - Spoonshot
No ratings yet
Let's Talk Flavor Pairing - Spoonshot
21 pages
Areeba-MS-IT Thesis Defence
No ratings yet
Areeba-MS-IT Thesis Defence
22 pages
Ashish Gandhe, Restaurant Recommendation System
No ratings yet
Ashish Gandhe, Restaurant Recommendation System
6 pages
Ashish Gandhe, Restaurant Recommendation System PDF
No ratings yet
Ashish Gandhe, Restaurant Recommendation System PDF
5 pages
Ashish Gandhe, Restaurant Recommendation System
No ratings yet
Ashish Gandhe, Restaurant Recommendation System
5 pages
1 s2.0 S0957417422004043 Main
No ratings yet
1 s2.0 S0957417422004043 Main
10 pages
Big Data Analytics Presentation
No ratings yet
Big Data Analytics Presentation
27 pages
Smart Cooking: Optimization Strategies in The Applications of Culinary Arts Through Data Science
No ratings yet
Smart Cooking: Optimization Strategies in The Applications of Culinary Arts Through Data Science
5 pages
ssrn-4686749
No ratings yet
ssrn-4686749
63 pages
Personalized Food Recommendation System by Using Machine Learning Models
No ratings yet
Personalized Food Recommendation System by Using Machine Learning Models
5 pages
AREEBA
No ratings yet
AREEBA
20 pages
Flavour Fusion
No ratings yet
Flavour Fusion
5 pages
Are There Any Unexpected Relationships? Which Ingredients Are Most Predictive of A Particular Cuisine?
No ratings yet
Are There Any Unexpected Relationships? Which Ingredients Are Most Predictive of A Particular Cuisine?
4 pages
Introduction To Text Mining
No ratings yet
Introduction To Text Mining
54 pages
JETIR1906V98
No ratings yet
JETIR1906V98
8 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
Equivalent Circuit Models Using CPE For Impedance
No ratings yet
Equivalent Circuit Models Using CPE For Impedance
23 pages
Difference Between Binomial and Poisson Distribution (With Comparison Chart) - Key Differences
No ratings yet
Difference Between Binomial and Poisson Distribution (With Comparison Chart) - Key Differences
12 pages
Negesse Final Thesis 11111
No ratings yet
Negesse Final Thesis 11111
98 pages
Types of Error
100% (1)
Types of Error
8 pages
Health Statistics Revision Questions
100% (2)
Health Statistics Revision Questions
8 pages
Sin X Taylor
No ratings yet
Sin X Taylor
64 pages
Ib Aahl - Topic 4 p3 - RV Questionbank
No ratings yet
Ib Aahl - Topic 4 p3 - RV Questionbank
5 pages
L22 DecisionTrees
No ratings yet
L22 DecisionTrees
14 pages
Funciones de Transferencia de Controladores PID
No ratings yet
Funciones de Transferencia de Controladores PID
6 pages
Model Building and Simulation of Thermoelectric Module Using Matlab-Simulink
100% (1)
Model Building and Simulation of Thermoelectric Module Using Matlab-Simulink
7 pages
Soil Moisture Sensor Calibration PDF
No ratings yet
Soil Moisture Sensor Calibration PDF
5 pages
Statics Intro
No ratings yet
Statics Intro
47 pages
A Contact Problem For A Smooth Rigid Disc Inclusion in A Penny-Shaped Crack
No ratings yet
A Contact Problem For A Smooth Rigid Disc Inclusion in A Penny-Shaped Crack
8 pages
5635.under The Hood of Flyback SMPS Designs
No ratings yet
5635.under The Hood of Flyback SMPS Designs
41 pages
CSE2213_Mid_202
No ratings yet
CSE2213_Mid_202
1 page
Nitte Meenakshi Institute of Technology
No ratings yet
Nitte Meenakshi Institute of Technology
13 pages
5 Steps How To Build Robust and Reliable Roulette System, Forever!
67% (3)
5 Steps How To Build Robust and Reliable Roulette System, Forever!
51 pages
Economic Order Quantity (EOQ) Model: Dr. Rakesh Kumar
No ratings yet
Economic Order Quantity (EOQ) Model: Dr. Rakesh Kumar
6 pages
Boris Stoyanov - The Dynamics of D-Branes With Dirac-Born-Infeld and Chern-Simons/Wess-Zumino Actions
No ratings yet
Boris Stoyanov - The Dynamics of D-Branes With Dirac-Born-Infeld and Chern-Simons/Wess-Zumino Actions
58 pages
Titanic Survival Prediction Using Machine Learning
No ratings yet
Titanic Survival Prediction Using Machine Learning
34 pages
Lesson 12 - Derivative of Inverse Trigonometric Functions
No ratings yet
Lesson 12 - Derivative of Inverse Trigonometric Functions
12 pages
Space Group
No ratings yet
Space Group
104 pages
Subtracting Up To "10": Chapter - 1
No ratings yet
Subtracting Up To "10": Chapter - 1
5 pages
Sarala Birla Public School: Name: - UID: - STD: XI
No ratings yet
Sarala Birla Public School: Name: - UID: - STD: XI
11 pages
Measures of Relative Position
No ratings yet
Measures of Relative Position
31 pages
(Ebook) Quantum information theory by Mark M. Wilde ISBN 9781107034259, 1107034256 download
No ratings yet
(Ebook) Quantum information theory by Mark M. Wilde ISBN 9781107034259, 1107034256 download
47 pages
Combinational Logic Design
No ratings yet
Combinational Logic Design
217 pages
Semi Detailed LP in Math VI REVISED
No ratings yet
Semi Detailed LP in Math VI REVISED
2 pages
The Triple Resonance Network With Sinusoidal Excitation: Antonio Carlos M. de Queiroz
No ratings yet
The Triple Resonance Network With Sinusoidal Excitation: Antonio Carlos M. de Queiroz
4 pages
HP-DSA Aplication Note 243-1
No ratings yet
HP-DSA Aplication Note 243-1
42 pages

ArffaLimRachleff LearningToCook Poster

Uploaded by

ArffaLimRachleff LearningToCook Poster

Uploaded by

Learning to Cook – An Exploration of Recipe Data

Goals Data/Features Random Forest

we could then verify with a recipe’s tags.

Results and discussion:

You might also like