I spoke at the first Kaizen Data Science Conference, San Francisco, Sep 2016 on one of Instacart's recommendation systems. Also covers innovative ways of using data science to solve interdisciplinary problems. - Sharath Rao
Learned Embeddings for Search and Discovery at InstacartSharath Rao
Learned word embeddings such as Word2vec/Glove were initially found to be effective for broad range of tasks in Natural Language Processing (NLP). More recently though, these are being used successfully in areas well beyond text such as graphs and event streams. In this talk Sharath will speak about how we use learned embeddings at Instacart for search ranking, personalization and product recommendations.
Presented at: SF Data Mining Meetup https://www.meetup.com/Data-Mining/events/237164197/
DataEngConf SF16 - Recommendations at InstacartHakka Labs
The document discusses recommendations at Instacart, an online grocery delivery service. It summarizes:
1) Instacart aims to provide personalized top N recommendations to promote discovery across its large and dynamic catalog of grocery products from various stores.
2) It also provides replacement product recommendations to help shoppers find substitutes when items are out of stock, drawing on models trained on product attributes and user purchase histories.
3) Additional recommendation types discussed include "frequently bought together" items and post-checkout suggestions to accommodate last-minute additions. The document outlines Instacart's recommendation system architecture and evaluation approach.
Instacart has revolutionized grocery shopping by bringing groceries to your door in a little as an hour. Behind the scenes, Instacart uses machine learning for everything from routing shoppers to ranking search results. In this talk, Jeremy will cover their recent tech blog post, Deep Learning with Emojis (not Math) ( https://tech.instacart.com/deep-learning-with-emojis-not-math-660ba1ad6cdc ), which details how Instacart is using Keras and Tensorflow to predict the sequence that shoppers will pick items in stores. Jeremy will discuss the data collection, mobile technology and deep learning architectures Instacart is applying to enable on-demand grocery delivery.
Overview of the Recommender system or recommendation system. RFM Concepts in brief. Collaborative Filtering in Item and User based. Content-based Recommendation also described.Product Association Recommender System. Stereotype Recommendation described with advantage and limitations.Customer Lifetime. Recommender System Analysis and Solving Cycle.
A Multi-Armed Bandit Framework For Recommendations at NetflixJaya Kawale
In this talk, we present a general multi-armed bandit framework for recommendations on the Netflix homepage. We present two example case studies using MABs at Netflix - a) Artwork Personalization to recommend personalized visuals for each of our members for the different titles and b) Billboard recommendation to recommend the right title to be watched on the Billboard.
This document discusses building a recommendation system for e-commerce. It begins by noting the importance of recommendations, with over 30% of online purchases coming from recommendations. It then discusses gathering data, both explicitly via ratings and reviews, and implicitly via user actions. Main approaches covered include content-based filtering, collaborative filtering using user-user and item-item similarities, and matrix factorization. The document also addresses challenges like sparsity, cold starts, scalability and privacy considerations in implementing recommendation systems.
Product Management and Metrics by Amazon Sr PMProduct School
This document discusses product management and metrics. It outlines certificate programs for product management from Product School and offers corporate training. Key takeaways include an overview of the product manager role and importance of metrics for product improvement. Common metrics frameworks like AARRR and HEART are explained. Examples of metrics used at Amazon and PayPal are provided. The document encourages continuing to learn about product metrics through various resources.
Recommender system algorithm and architectureLiang Xiang
1) The document discusses recommender system algorithms and architecture. It covers common recommendation techniques like collaborative filtering, content-based filtering, and graph-based recommendations.
2) It also discusses challenges like cold starts for new users and items. For new users, it recommends using demographic data or initial feedback to understand interests. For new items, it suggests using content information or initial user feedback.
3) The document proposes a feature-based recommendation framework that connects users, items, and latent features to address challenges like heterogeneous data and cold starts. This framework provides explanations but does not support user-based methods.
What really are recommendations engines nowadays?
This presentation introduces the foundations of recommendation algorithms, and covers common approaches as well as some of the most advanced techniques. Although more focused on efficiency than theoretical properties, basics of matrix algebra and optimization-based machine learning are used through the presentation.
Table of Contents:
1. Collaborative Filtering
1.1 User-User
1.2 Item-Item
1.3 User-Item
* Matrix Factorization
* Stochastic Gradient Descent (SGD)
* Truncated Singular Value Decomposition (SVD)
* Alternating Least Square (ALS)
* Deep Learning
2. Content Extraction
* Item-Item Similarities
* Deep Content Extraction: NLP, CNN, LSTM
3. Hybrid Models
4. In Production
4.1 Problematics
4.2 Solutions
4.3 Tools
It contains detail description about recommendation system which is a part of machine learning algorithms. It contains diagrams and examples for better understanding and description purpose with to the point detail matter.
Recommender Systems from A to Z – Model TrainingCrossing Minds
This second meetup will be about training different models for our recommender system. We will review the simple models we can build as a baseline. After that, we will present the recommender system as an optimization problem and discuss different training losses. We will mention linear models and matrix factorization techniques. We will end the presentation with a simple introduction to non-linear models and deep learning.
Applied Machine Learning for Ranking Products in an Ecommerce SettingDatabricks
As a leading e-commerce company in fashion in the Netherlands, Wehkamp dedicates itself to provide a better shopping experience for the customers. Using Spark, the data science team is able to develop various machine-learning projects for this purpose based on the large scale data of products and customers. A major topic for the data science team is ranking products. If a visitor enters a search phrase, what are the best products that fit the search phrase and in what order should the products been shown? Ranking products is also important if a visitor enters a product overview page, where hundreds or even thousands of products of a certain article type are displayed.
In this project, Spark is used in the whole pipeline: retrieving and processing the search phrases and their results, making click models, creating feature sets, training and evaluating ranking models, pushing the models to production using ElasticSearch and creating Tableau dashboarding. In this talk, we are going to demonstrate how we use Spark to build up the whole pipeline of ranking products and the challenges we faced along the way.
The document provides an overview of recommender systems. It discusses the typical architecture of recommender systems and describes three main types: collaborative filtering systems, content-based systems, and knowledge-based systems. It also covers paradigms like collaborative filtering, content-based, knowledge-based, and hybrid recommender systems. The document then focuses on collaborative filtering techniques like user-based nearest neighbor collaborative filtering and item-based collaborative filtering. It also discusses latent factor models, matrix factorization approaches, and context-based recommender systems.
basic Function and Terminology of Recommendation Systems. Some Algorithmic Implementation with some sample Dataset for Understanding. It contains all the Layers of RS Framework well explained.
Recommender Systems represent one of the most widespread and impactful applications of predictive machine learning models.
Amazon, YouTube, Netflix, Facebook and many other companies generate an important fraction of their revenues thanks to their ability to model and accurately predict users ratings and preferences.
In this presentation we cover the following points:
→ introduction to recommender systems
→ working with explicit vs implicit feedback
→ content-based vs collaborative filtering approaches
→ user-based and item-item methods
→ machine learning and deep learning models
→ pros & cons of the methods: scalability, accuracy, explainability
Part of the course "Algorithmic Methods of Data Science". Sapienza University of Rome, 2015.
http://aris.me/index.php/data-mining-ds-2015
This document provides an overview of recommendation engines and systems. It describes different types of recommendation approaches, including collaborative filtering, content-based filtering, and hybrid methods. It also discusses how recommendation algorithms work and are implemented in Apache Mahout, a machine learning library for developing scalable recommendation applications. Key recommendation techniques like item-based filtering and user-based filtering are explained.
Deep Learning for Personalized Search and Recommender SystemsBenjamin Le
Slide deck presented for a tutorial at KDD2017.
https://engineering.linkedin.com/data/publications/kdd-2017/deep-learning-tutorial
Join one of our in-house experts as we walk through best practices for Amazon Sponsored Brands and Sponsored Products ads. We review tactics to drive ROI as well as how to avoid common pitfalls we see vendors make.
Anatomy of an eCommerce Search Engine by Mayur DatarNaresh Jain
This document discusses search architecture and optimization for e-commerce platforms. It describes how search is a critical feature that powers recommendations and sales. Key challenges include large catalogs that change frequently, diverse user needs like geo-specific ranking, and balancing multiple objectives. The document outlines the technical infrastructure supporting search, including serving architecture, indexing workflows, and approaches to improve quality like query understanding and personalization.
Interactive Recommender Systems with Netflix and SpotifyChris Johnson
Interactive recommender systems enable the user to steer the received recommendations in the desired direction through explicit interaction with the system. In the larger ecosystem of recommender systems used on a website, it is positioned between a lean-back recommendation experience and an active search for a specific piece of content. Besides this aspect, we will discuss several parts that are especially important for interactive recommender systems, including the following: design of the user interface and its tight integration with the algorithm in the back-end; computational efficiency of the recommender algorithm; as well as choosing the right balance between exploiting the feedback from the user as to provide relevant recommendations, and enabling the user to explore the catalog and steer the recommendations in the desired direction.
In particular, we will explore the field of interactive video and music recommendations and their application at Netflix and Spotify. We outline some of the user-experiences built, and discuss the approaches followed to tackle the various aspects of interactive recommendations. We present our insights from user studies and A/B tests.
The tutorial targets researchers and practitioners in the field of recommender systems, and will give the participants a unique opportunity to learn about the various aspects of interactive recommender systems in the video and music domain. The tutorial assumes familiarity with the common methods of recommender systems.
At Netflix, we try to provide the best personalized video recommendations to our members. To do this, we need to adapt our recommendations for each contextual situation, which depends on information such as time or device. In this talk, I will describe how state of the art Contextual Recommendations are used at Netflix. A first example of contextual adaptation is the model that powers the Continue Watching row. It uses a feature-based approach with a carefully constructed training set to learn how to adapt to the context of the member. Next, I will dive into more modern approaches such as Tensor Factorization and LSTMs and share some results from deployments of these methods. I will highlight lessons learned and some common pitfalls of using these powerful methods in industrial scale systems. Finally, I will touch upon system reliability, choice of optimization metrics, hidden costs, risks and benefits of using highly adaptive systems.
Recommender systems are software agents that analyze a user's preferences through transactions and provide personalized recommendations accordingly. There are several recommendation paradigms including non-personalized rules, personalized rules based on user data, and transaction-based collaborative filtering that learns from user interactions. Context-based recommender systems also consider additional information like time, location, or device to provide adaptive recommendations. Common techniques used in recommender systems include content-based filtering that recommends similar items, collaborative filtering that finds users with similar tastes, and demographic-based recommendations.
This document analyzes the business model and financials of Instacart, a grocery delivery startup. It summarizes that Instacart's revenues grew from $700,000 in 2012 to over $100 million in 2014 using a model where they partner with retailers but do not operate warehouses or fleets themselves. Porter's Five Forces and SWOT analyses are presented to assess the sustainability of Instacart's model and competitive environment. The conclusion recommends Instacart focus on maintaining high margins, expanding their shopper network, and partnering with more retailers to exploit their first mover advantage.
PSFK Future of Retail 2015 Report - Summary PresentationPSFK
Get your copy of The Future of Retail 2015: www.psfk.com/report/future-of-retail-2015
In the fifth volume of the Future of Retail report the PSFK Labs team explores the dynamic social, technological, and physical forces influencing consumer behavior and driving next-generation shopping experiences. With a refocus on the importance of the physical store, our analysis below includes 10 in-store strategies supported by over a dozen key trends that retailers can use to immediately begin redefining their retail experience.
The report looks at how, in order to stand out from the competition, retailers and brands must make the best use of their customers’ time and attention by designing multichannel experiences that strike a perfect balance between efficiency and enjoyment, relevance and surprise.
Featured within the 110 page report, readers can find:
- 10 strategies to redefine the store
- Over a dozen global trends changing retail
- 20 future store concepts
- Perspectives from leading shopper experts across the globe
If you are interested in seeing a presentation of this report or would like to understand how PSFK can help your team ideate new possibilities for your brand, contact us at [email protected]
Vol. 5 | Published November 2014
All rights reserved. No parts of this publication may be reproduced without the written permission of PSFK Labs.
Recommender system algorithm and architectureLiang Xiang
1) The document discusses recommender system algorithms and architecture. It covers common recommendation techniques like collaborative filtering, content-based filtering, and graph-based recommendations.
2) It also discusses challenges like cold starts for new users and items. For new users, it recommends using demographic data or initial feedback to understand interests. For new items, it suggests using content information or initial user feedback.
3) The document proposes a feature-based recommendation framework that connects users, items, and latent features to address challenges like heterogeneous data and cold starts. This framework provides explanations but does not support user-based methods.
What really are recommendations engines nowadays?
This presentation introduces the foundations of recommendation algorithms, and covers common approaches as well as some of the most advanced techniques. Although more focused on efficiency than theoretical properties, basics of matrix algebra and optimization-based machine learning are used through the presentation.
Table of Contents:
1. Collaborative Filtering
1.1 User-User
1.2 Item-Item
1.3 User-Item
* Matrix Factorization
* Stochastic Gradient Descent (SGD)
* Truncated Singular Value Decomposition (SVD)
* Alternating Least Square (ALS)
* Deep Learning
2. Content Extraction
* Item-Item Similarities
* Deep Content Extraction: NLP, CNN, LSTM
3. Hybrid Models
4. In Production
4.1 Problematics
4.2 Solutions
4.3 Tools
It contains detail description about recommendation system which is a part of machine learning algorithms. It contains diagrams and examples for better understanding and description purpose with to the point detail matter.
Recommender Systems from A to Z – Model TrainingCrossing Minds
This second meetup will be about training different models for our recommender system. We will review the simple models we can build as a baseline. After that, we will present the recommender system as an optimization problem and discuss different training losses. We will mention linear models and matrix factorization techniques. We will end the presentation with a simple introduction to non-linear models and deep learning.
Applied Machine Learning for Ranking Products in an Ecommerce SettingDatabricks
As a leading e-commerce company in fashion in the Netherlands, Wehkamp dedicates itself to provide a better shopping experience for the customers. Using Spark, the data science team is able to develop various machine-learning projects for this purpose based on the large scale data of products and customers. A major topic for the data science team is ranking products. If a visitor enters a search phrase, what are the best products that fit the search phrase and in what order should the products been shown? Ranking products is also important if a visitor enters a product overview page, where hundreds or even thousands of products of a certain article type are displayed.
In this project, Spark is used in the whole pipeline: retrieving and processing the search phrases and their results, making click models, creating feature sets, training and evaluating ranking models, pushing the models to production using ElasticSearch and creating Tableau dashboarding. In this talk, we are going to demonstrate how we use Spark to build up the whole pipeline of ranking products and the challenges we faced along the way.
The document provides an overview of recommender systems. It discusses the typical architecture of recommender systems and describes three main types: collaborative filtering systems, content-based systems, and knowledge-based systems. It also covers paradigms like collaborative filtering, content-based, knowledge-based, and hybrid recommender systems. The document then focuses on collaborative filtering techniques like user-based nearest neighbor collaborative filtering and item-based collaborative filtering. It also discusses latent factor models, matrix factorization approaches, and context-based recommender systems.
basic Function and Terminology of Recommendation Systems. Some Algorithmic Implementation with some sample Dataset for Understanding. It contains all the Layers of RS Framework well explained.
Recommender Systems represent one of the most widespread and impactful applications of predictive machine learning models.
Amazon, YouTube, Netflix, Facebook and many other companies generate an important fraction of their revenues thanks to their ability to model and accurately predict users ratings and preferences.
In this presentation we cover the following points:
→ introduction to recommender systems
→ working with explicit vs implicit feedback
→ content-based vs collaborative filtering approaches
→ user-based and item-item methods
→ machine learning and deep learning models
→ pros & cons of the methods: scalability, accuracy, explainability
Part of the course "Algorithmic Methods of Data Science". Sapienza University of Rome, 2015.
http://aris.me/index.php/data-mining-ds-2015
This document provides an overview of recommendation engines and systems. It describes different types of recommendation approaches, including collaborative filtering, content-based filtering, and hybrid methods. It also discusses how recommendation algorithms work and are implemented in Apache Mahout, a machine learning library for developing scalable recommendation applications. Key recommendation techniques like item-based filtering and user-based filtering are explained.
Deep Learning for Personalized Search and Recommender SystemsBenjamin Le
Slide deck presented for a tutorial at KDD2017.
https://engineering.linkedin.com/data/publications/kdd-2017/deep-learning-tutorial
Join one of our in-house experts as we walk through best practices for Amazon Sponsored Brands and Sponsored Products ads. We review tactics to drive ROI as well as how to avoid common pitfalls we see vendors make.
Anatomy of an eCommerce Search Engine by Mayur DatarNaresh Jain
This document discusses search architecture and optimization for e-commerce platforms. It describes how search is a critical feature that powers recommendations and sales. Key challenges include large catalogs that change frequently, diverse user needs like geo-specific ranking, and balancing multiple objectives. The document outlines the technical infrastructure supporting search, including serving architecture, indexing workflows, and approaches to improve quality like query understanding and personalization.
Interactive Recommender Systems with Netflix and SpotifyChris Johnson
Interactive recommender systems enable the user to steer the received recommendations in the desired direction through explicit interaction with the system. In the larger ecosystem of recommender systems used on a website, it is positioned between a lean-back recommendation experience and an active search for a specific piece of content. Besides this aspect, we will discuss several parts that are especially important for interactive recommender systems, including the following: design of the user interface and its tight integration with the algorithm in the back-end; computational efficiency of the recommender algorithm; as well as choosing the right balance between exploiting the feedback from the user as to provide relevant recommendations, and enabling the user to explore the catalog and steer the recommendations in the desired direction.
In particular, we will explore the field of interactive video and music recommendations and their application at Netflix and Spotify. We outline some of the user-experiences built, and discuss the approaches followed to tackle the various aspects of interactive recommendations. We present our insights from user studies and A/B tests.
The tutorial targets researchers and practitioners in the field of recommender systems, and will give the participants a unique opportunity to learn about the various aspects of interactive recommender systems in the video and music domain. The tutorial assumes familiarity with the common methods of recommender systems.
At Netflix, we try to provide the best personalized video recommendations to our members. To do this, we need to adapt our recommendations for each contextual situation, which depends on information such as time or device. In this talk, I will describe how state of the art Contextual Recommendations are used at Netflix. A first example of contextual adaptation is the model that powers the Continue Watching row. It uses a feature-based approach with a carefully constructed training set to learn how to adapt to the context of the member. Next, I will dive into more modern approaches such as Tensor Factorization and LSTMs and share some results from deployments of these methods. I will highlight lessons learned and some common pitfalls of using these powerful methods in industrial scale systems. Finally, I will touch upon system reliability, choice of optimization metrics, hidden costs, risks and benefits of using highly adaptive systems.
Recommender systems are software agents that analyze a user's preferences through transactions and provide personalized recommendations accordingly. There are several recommendation paradigms including non-personalized rules, personalized rules based on user data, and transaction-based collaborative filtering that learns from user interactions. Context-based recommender systems also consider additional information like time, location, or device to provide adaptive recommendations. Common techniques used in recommender systems include content-based filtering that recommends similar items, collaborative filtering that finds users with similar tastes, and demographic-based recommendations.
This document analyzes the business model and financials of Instacart, a grocery delivery startup. It summarizes that Instacart's revenues grew from $700,000 in 2012 to over $100 million in 2014 using a model where they partner with retailers but do not operate warehouses or fleets themselves. Porter's Five Forces and SWOT analyses are presented to assess the sustainability of Instacart's model and competitive environment. The conclusion recommends Instacart focus on maintaining high margins, expanding their shopper network, and partnering with more retailers to exploit their first mover advantage.
PSFK Future of Retail 2015 Report - Summary PresentationPSFK
Get your copy of The Future of Retail 2015: www.psfk.com/report/future-of-retail-2015
In the fifth volume of the Future of Retail report the PSFK Labs team explores the dynamic social, technological, and physical forces influencing consumer behavior and driving next-generation shopping experiences. With a refocus on the importance of the physical store, our analysis below includes 10 in-store strategies supported by over a dozen key trends that retailers can use to immediately begin redefining their retail experience.
The report looks at how, in order to stand out from the competition, retailers and brands must make the best use of their customers’ time and attention by designing multichannel experiences that strike a perfect balance between efficiency and enjoyment, relevance and surprise.
Featured within the 110 page report, readers can find:
- 10 strategies to redefine the store
- Over a dozen global trends changing retail
- 20 future store concepts
- Perspectives from leading shopper experts across the globe
If you are interested in seeing a presentation of this report or would like to understand how PSFK can help your team ideate new possibilities for your brand, contact us at [email protected]
Vol. 5 | Published November 2014
All rights reserved. No parts of this publication may be reproduced without the written permission of PSFK Labs.
My talk at DataEngConf, on Instacart's first discovery based recommendation systems that uses machine learning methods on large scale datasets in online grocery.
Kimchi Ho is a community-minded designer looking for an opportunity to do great work. She has strong creative, project management, and communication skills developed over a career in architecture and design. Her experience includes roles at sustainability firms, condo developers, and landscape architecture practices. She is passionate about entrepreneurship and using her skills and network to make positive change.
Dr. Martin Luther King Jr. was America's preeminent advocate of nonviolence who led the civil rights movement in the late 1950s and 1960s to achieve legal equality for African Americans, drawing inspiration from his Christian faith and the teachings of Mahatma Gandhi to pursue nonviolent protest.
Amazon is considering launching a new grocery delivery service called Amazon Fresh. The key points from the document are:
1) The online grocery market is estimated to be $7 billion, but adoption rates are difficult to predict and will be the biggest challenge.
2) Profitability in online grocery is low due to intense competition and low grocery industry margins. Amazon Fresh would not be profitable for several years.
3) Top competitors include Walmart, Instacart, and traditional grocers moving online. Walmart and Instacart present the greatest threats due to price, delivery speed, and selection.
Download a full version of the report at:
www.psfk.com/report/future-of-retail-2016
Built on a robust study of trends and patterns in the market, the 6th edition of PSFK Labs’ Future of Retail report offers a directional playbook for brands and retailers – defining 10 pillars to build a modern and engaging shopper experience strategy and go beyond expectations to create an enhanced shopper experience and therefore, build value, drive sales, and boost loyalty.
Featured within the 80+ page report, readers can find:
- 10 actions every retailer can adapt to redefine the shopper experience
- 20 key trends driving change in the marketplace
- Future service concepts for top brands
- Perspectives from leading retail experts across the globe
If you are interested in seeing a presentation of this report or would like to understand how PSFK can help your team ideate new possibilities for your brand, contact us at [email protected]
Vol. 6 | Published November 2015
All rights reserved. No parts of this publication may be reproduced without the written permission of PSFK Labs.
Solving for X: Why the Future of Business is ExperientialBrian Solis
Products don't define a brand, experiences do. Brian Solis explains why companies must shift from product-centric strategies to cultivating outstanding experiences to remain competitive. By Judith Aquino, TeleTech.
Every startup begins with an idea. This is a talk on how to come up with startup ideas and how to use validation to pick the ones worth working on. It's based on the book "Hello, Startup" (http://www.hello-startup.net/). You can find the video of the talk here: https://www.youtube.com/watch?v=GkmiE8d_5Pw
WrangleConf 2017 - Lessons from Integrating ML models into Data ProductsSharath Rao
Lessons From Integrating Machine Learning Models Into Data Products
This document discusses lessons learned from integrating machine learning models into production at Instacart. It explains that as products grow more complex, models need to be adapted to share inputs, have model outputs serve as inputs to other models, and meet scale and service level agreements. Key considerations for operationalizing models include scoring latency, how much models benefit from contextual or real-time data, and whether models are best suited for real-time or batch scoring. The document provides examples of machine learning use cases at Instacart and questions to consider about model integration.
Market basket analysis examines how products are purchased together by analyzing customer transaction data. It identifies associations between products that are frequently purchased together. These associations can be represented as rules, such as "customers who buy pasta and cheese also tend to buy olive oil." Retailers can use these rules to inform decisions around product placement, promotions, and bundling. The strength of an association is measured by its support, confidence, and lift. Sequential analysis extends this technique to consider the order items were purchased in.
The document discusses how beacon analytics and algorithms can be used in retail to provide benefits like targeted product recommendations and optimized supply chain management. It provides examples of different recommendation algorithms like collaborative filtering that make suggestions based on users with similar behaviors. Analytics can also help with supply chain planning, procurement spending analysis, inventory management, and other key performance areas. Specific metrics that can be analyzed are outlined like order fill rates, late shipments, supplier performance, inventory turns, and more.
DataEngConf 2017 - Machine Learning Models in ProductionSharath Rao
- Integrating machine learning models into customer workflows
- Economies of scope with data products
- Using a shared features store for reusing features across models
Lessons From Integrating Machine Learning into Data Products | Wrangle Confer...Cloudera, Inc.
In this talk, we will share practical lessons and patterns for building machine learning (ML) models in production, based on our experience with search ranking and recommendation systems at Instacart. As part of this I will include a detailed discussion on the technical challenges in building a ML features pipeline, one of which is now shared across multiple data products at Instacart.
Webinar: Increase Conversion With Better SearchLucidworks
This document discusses a partnership between IBM Commerce and Lucidworks to improve e-commerce search experiences. Key points:
1. The partnership will integrate Lucidworks' Apache Solr-based search platform Fusion with IBM Commerce to power search and recommendations on IBM Commerce sites.
2. Fusion will enrich product content, queries, and results to improve findability. It will also use signals from user interactions for more relevant results and personalized recommendations.
3. The integration aims to improve customer experiences and conversions by ensuring customers can find products through various query types and discover related items to buy.
Winning Supply Chain in Omnichannel - Trends and ImplicationsMichael Hu
I gave a talk at Professor Chopra's class at Kellogg on emerging trends in omnichannel retailing and the need for new supply chain and fulfillment models.
The document discusses how companies can use data and experimentation to improve consumer products and business metrics. It recommends that companies (1) collect extensive customer data, (2) instrument all customer touchpoints to measure key metrics, and (3) analyze the data to identify correlations and opportunities for improvement. Regular experimentation is important to continuously innovate and optimize the customer experience.
The document discusses how supply chain analytics can help organizations optimize their supply chain operations. It describes how the changing role of consumers has impacted supply chains and the need for collaboration, visibility, and efficiency across the supply chain. It then provides examples of different types of supply chain analytics and insights organizations can gain in areas like executive dashboards, supply chain design, demand forecasting, pricing, inventory management, and more. It also provides a brief case study of how Team Computers implemented an analytics solution for Parle Products to track stock levels, sales, and shortfalls.
Find out how retailers can utilize 2014 purchase data to predict 2015 holiday buying.
Highlights include:
-Original research from "Turning 2014 Holiday Trends into 2015 Revenue" by Oracle Marketing Cloud and Edison Research
-Holiday retention strategies from Windsor Circle
-Successful holiday campaign examples from Artbeads
Big data certification training mumbaiTejaspathiLV
ExcelR offers 160 hours classroom training on Business Analytics / Data Scientist / Data Analytics. We are considered as one of the best training institutes on Business Analytics in Mumbai. “Faculty and vast course agenda is our differentiator”. The training is conducted by alumni of premier institutions such as IIT & ISB who has extensive experience in the arena of analytics. They are considered to be one of the best trainers in the industry. The topics covered as part of this Data Scientist Certification program is on par with most of the Master of Science in Analytics (MS in Business Analytics / MS in Data Analytics) programs across the top-notch universities of the globe.
ExcelR is a proud partner of Universiti Malaysia Saravak (UNIMAS), Malaysia’s 1st public University and ranked 8th top university in Malaysia and ranked among top 200th in Asian University Rankings 2017 by QS World University Rankings. Participants will be awarded Data Science international certification from UNIMAS.
ExcelR is a proud partner of Universiti Malaysia Saravak (UNIMAS), Malaysia’s 1st public University and ranked 8th top university in Malaysia and ranked among top 200th in Asian University Rankings 2017 by QS World University Rankings. Participants will be awarded Data Science international certification from UNIMAS.
ExcelR is the best Data Science training institute in Hyderabad which offers services from training to placement as part of the Data Science training program with over 400+ participants placed in various multinational companies including E&Y, Panasonic, Accenture, VMWare, Infosys, IBM, etc….and the staff is from NIT’s & IIT’s
The document discusses strategies for achieving an agile supply chain. It defines three types of supply chains - traditional, lean, and agile. An agile supply chain is well-suited for innovative products with unpredictable demand. The characteristics of an agile supply chain include flexibility, market sensitivity through a virtual network, postponement, and applying some lean principles upstream of the decoupling point. Rules for achieving agility include accepting uncertainty, reducing it through better forecasting, avoiding it through flexibility and short lead times, and hedging against it with buffers. The case of Zara demonstrates market sensitivity, postponement, and achieving flexibility through a hybrid lean-agile approach.
Rishabh Misra, Mengting Wan, Julian McAuley, “Decomposing Fit Semantics for Product Size Recommendation in Metric Spaces”, in Proceedings of 2018 ACM Conference on Recommender Systems (RecSys’18), Vancouver, Canada, Oct. 2018
Predictive Analytics for Customer Targeting: A Telemarketing Banking ExamplePedro Ecija Serrano
This document discusses using predictive analytics and machine learning models to identify customers likely to purchase bank deposits. It tests various techniques including oversampling, undersampling, and generating synthetic data to address class imbalance in the dataset. Models tested include naive Bayes, support vector machines, decision trees, and ensembles. The best performing techniques were under sampling naive Bayes and support vector machines, predicting over 60% of buyers with around 25% of calls. Key factors identified for predicting purchases included customer contact history, economic conditions, time of year, and demographics.
This chapter discusses creating long term loyalty relationships with customers. It covers building customer value and satisfaction, determining factors of customer perceived value, customer loyalty, satisfaction, quality, and profitability. It also discusses customer lifetime value equations, cultivating customer relationships through customer relationship management best practices, using customer databases for marketing, and potential downsides.
Manthan is one of the best Restaurant Analytics Software Company in US which provides comprehensive AI-powered solution that addresses every need of the contemporary restaurant chain. With Customer Analytics for restaurant marketing, targeting and personalization, Demand Analytics for identifying opportunities and Operational Analytics for day-to-day management.
APNIC -Policy Development Process, presented at Local APIGA Taiwan 2025APNIC
Joyce Chen, Senior Advisor, Strategic Engagement at APNIC, presented on 'APNIC Policy Development Process' at the Local APIGA Taiwan 2025 event held in Taipei from 19 to 20 April 2025.
APNIC Update, presented at NZNOG 2025 by Terry SweetserAPNIC
Terry Sweetser, Training Delivery Manager (South Asia & Oceania) at APNIC presented an APNIC update at NZNOG 2025 held in Napier, New Zealand from 9 to 11 April 2025.
DNS Resolvers and Nameservers (in New Zealand)APNIC
Geoff Huston, Chief Scientist at APNIC, presented on 'DNS Resolvers and Nameservers in New Zealand' at NZNOG 2025 held in Napier, New Zealand from 9 to 11 April 2025.
Best web hosting Vancouver 2025 for you businesssteve198109
Vancouver in 2025 is more than scenic views, yoga studios, and oat milk lattes—it’s a thriving hub for eco-conscious entrepreneurs looking to make a real difference. If you’ve ever dreamed of launching a purpose-driven business, now is the time. Whether it’s urban mushroom farming, upcycled furniture sales, or vegan skincare sold online, your green idea deserves a strong digital foundation.
The 2025 Canadian eCommerce landscape is being shaped by trends like sustainability, local innovation, and consumer trust. To stay ahead, eco-startups need reliable hosting that aligns with their values. That’s where 4GoodHosting.com comes in—one of the top-rated Vancouver web hosting providers of 2025. Offering secure, sustainable, and Canadian-based hosting solutions, they help green entrepreneurs build their brand with confidence and conscience.
As eCommerce in Canada embraces localism and environmental responsibility, choosing a hosting provider that shares your vision is essential. 4GoodHosting goes beyond just hosting websites—they champion Canadian businesses, sustainable practices, and meaningful growth.
So go ahead—start that eco-friendly venture. With Vancouver web hosting from 4GoodHosting, your green business and your values are in perfect sync.
Reliable Vancouver Web Hosting with Local Servers & 24/7 Supportsteve198109
Looking for powerful and affordable web hosting in Vancouver? 4GoodHosting offers premium Canadian web hosting solutions designed specifically for individuals, startups, and businesses across British Columbia. With local data centers in Vancouver and Toronto, we ensure blazing-fast website speeds, superior uptime, and enhanced data privacy—all critical for your business success in today’s competitive digital landscape.
Our Vancouver web hosting plans are packed with value—starting as low as $2.95/month—and include secure cPanel management, free domain transfer, one-click WordPress installs, and robust email support with anti-spam protection. Whether you're hosting a personal blog, business website, or eCommerce store, our scalable cloud hosting packages are built to grow with you.
Enjoy enterprise-grade features like daily backups, DDoS protection, free SSL certificates, and unlimited bandwidth on select plans. Plus, our expert Canadian support team is available 24/7 to help you every step of the way.
At 4GoodHosting, we understand the needs of local Vancouver businesses. That’s why we focus on speed, security, and service—all hosted on Canadian soil. Start your online journey today with a reliable hosting partner trusted by thousands across Canada.
Top Vancouver Green Business Ideas for 2025 Powered by 4GoodHostingsteve198109
Vancouver in 2025 is more than scenic views, yoga studios, and oat milk lattes—it’s a thriving hub for eco-conscious entrepreneurs looking to make a real difference. If you’ve ever dreamed of launching a purpose-driven business, now is the time. Whether it’s urban mushroom farming, upcycled furniture sales, or vegan skincare sold online, your green idea deserves a strong digital foundation.
The 2025 Canadian eCommerce landscape is being shaped by trends like sustainability, local innovation, and consumer trust. To stay ahead, eco-startups need reliable hosting that aligns with their values. That’s where 4GoodHosting.com comes in—one of the top-rated Vancouver web hosting providers of 2025. Offering secure, sustainable, and Canadian-based hosting solutions, they help green entrepreneurs build their brand with confidence and conscience.
As eCommerce in Canada embraces localism and environmental responsibility, choosing a hosting provider that shares your vision is essential. 4GoodHosting goes beyond just hosting websites—they champion Canadian businesses, sustainable practices, and meaningful growth.
So go ahead—start that eco-friendly venture. With Vancouver web hosting from 4GoodHosting, your green business and your values are in perfect sync.
Understanding the Tor Network and Exploring the Deep Webnabilajabin35
While the Tor network, Dark Web, and Deep Web can seem mysterious and daunting, they are simply parts of the internet that prioritize privacy and anonymity. Using tools like Ahmia and onionland search, users can explore these hidden spaces responsibly and securely. It’s essential to understand the technology behind these networks, as well as the risks involved, to navigate them safely. Visit https://torgol.com/
Smart Mobile App Pitch Deck丨AI Travel App Presentation Templateyojeari421237
🚀 Smart Mobile App Pitch Deck – "Trip-A" | AI Travel App Presentation Template
This professional, visually engaging pitch deck is designed specifically for developers, startups, and tech students looking to present a smart travel mobile app concept with impact.
Whether you're building an AI-powered travel planner or showcasing a class project, Trip-A gives you the edge to impress investors, professors, or clients. Every slide is cleanly structured, fully editable, and tailored to highlight key aspects of a mobile travel app powered by artificial intelligence and real-time data.
💼 What’s Inside:
- Cover slide with sleek app UI preview
- AI/ML module implementation breakdown
- Key travel market trends analysis
- Competitor comparison slide
- Evaluation challenges & solutions
- Real-time data training model (AI/ML)
- “Live Demo” call-to-action slide
🎨 Why You'll Love It:
- Professional, modern layout with mobile app mockups
- Ideal for pitches, hackathons, university presentations, or MVP launches
- Easily customizable in PowerPoint or Google Slides
- High-resolution visuals and smooth gradients
📦 Format:
- PPTX / Google Slides compatible
- 16:9 widescreen
- Fully editable text, charts, and visuals
16. v
“Frequently bought with” Recommendations
Not necessarily
consumed together
Help customers shop for
complementary products
and try alternatives
Probably
consumed together
18. v
Learning from feedback
Traditionally collaborative filtering used explicit feedback to predict ratings
There may still bias in whether the user chooses to rate
Explicit Feedback Implicit Feedback
19. v
Learning from Explicit Feedback
• Explicit feedback may be more reliable but there is much less of it
• Less reliable if users rate based on aspirations instead of true preferences
vs
20. v
Implicit Feedback - trade-off quality and quantity
Strengthofevidence
Number of Events
21. v
Architecture
Event Data Score and
Select Top N
(Spark/EMR)
User/Product Factors
Event Data
Run-time
ranking for
diversity
Candidate
Selection
ALS
(Spark/EMR)
Generate
User-Product
Matrix
22. v
A Matrix Factorization Formulation for Implicit Feedback
N Products
MUsers
1
-
-
9
-
-
-
3
20
User Product Matrix
R; (M x N)
1
0
0
1
0
0
0
1
1binary
preferences
Preference Matrix R;
(M x N)
“Collaborative Filtering for Implicit Feedback” - Hu et. al
23. v
A Matrix Factorization Formulation for Implicit Feedback
~
Y
XT
Product Factors
(k x N)
User Factors
(M x k)
1
0
0
1
0
0
0
1
1
x
Preference Matrix R;
(M x N)
24. v
Matrix Factorization from Implicit Feedback - The Intuition
#Purchases Preference p Confidence c
0 0 Low
1 1 Low
>>1 1 High
• Confidence increases linearly with purchases r
• c = 1 + alpha * r
• alpha controls the marginal rate of learning from user purchases
• Key questions
• How should the unobserved events be treated
• How should one trade-off observed and the unobserved
25. v
Regularized Weighted Squared Loss
Confidence
User
Factors
Matrix
Product
Factors
Matrix
Preference
Matrix Regularization
Solve using Alternating Least Squares
27. v
Spark ALS Hyper-parameter Tuning
• rank k - diminishing returns after 150
• alpha - controls rate of learning from observed events
• iterations - ALS tends to converge within 5, seldom more than 10
• lambda - regularization parameter
29. v
Scoring user and products
With millions of products and users, scoring every (user, product) pair is prohibitive
Two goals in selecting products to score
• Long tail which have not been discovered
• Products that have an a priori high purchase rate (popular)
~
30. v
Trade-off popularity and discovery in the tail
We start with simple stratified sampling
For each user, score N products
Sample h products from Head
Sample t products from tail
N ~ 10000
h ~ 3000
t ~7000
31. v
Tuning Spark For ALS
Understanding Spark execution model and its implementation of ALS helps
• Training is communication heavy1
, set partitions <= #CPU cores
• Scoring is memory intensive
• Broad guidelines2
• Limit executor memory to 64GB
• 5 cores per executor
• Set executors based on data size
1 - http://apache-spark-user-list.1001560.n3.nabble.com/Error-No-space-left-on-device-tp9887p9896.html
2 - http://blog.cloudera.com/blog/2015/03/how-to-tune-your-apache-spark-jobs-part-1/
33. v
A/B Test Results
• Statistically significant increases
• Items per order
• GMV per order
• Total product sales spread over more
categories
34. v
Ok, we have a recommendation system
Where do we go from here?
35. v
What else do you do with user and product factors?
Score (user, product) pair on demand
Get Top N similar users
Get Top N similar product
As features in other models
36. v
Products similar to “Haigs Spicy Hummus"
More “Spicy Hummus”
Spicy Salsas
Generated using Approximate Nearest Neighbor
(“annoy” from Spotify)
37. v
What next
• Make recommendations more contextual
• Explain recommendations (“Because you did X”)
42. v
Traditional E-commerce
• Manage inventory in warehouses optimized for quick
fulfillment
• Customers only specify the “What”
• Disallow users from ordering out of stock products
• Set expectations
• “3 day shipping” but will ship in 10 business days
43. v
On-demand delivery from local retailers
• Shoppers navigate a complex environment where products
• may have run out
• may be misplaced
• may be damaged
• Customers specify “What”, “When” and “Where from”
• Improvise under uncertainty
44. v
Customers
Advertisers
(brands)
Stores
(Retailers)
lose revenue and
trust of customers
Everybody loses when out of stocks happen
• don’t get exactly what
they want
• must contemplate
and/or communicate
replacements
lose revenue and
trust of customers
• waste time searching for
items that aren’t in store
• context switch to
searching and
communicating
replacements
Shoppers
47. v
A probable solution
Do not show or allow customers to order items
that are currently out of stock
48. v
A probable (but terrible) solution
• Customers really know these stores
• “Missing” items is seen as a sign of an unreliable catalog/service
• May have been out of stock this morning but could be available when the
order is fulfilled
• Sets up negative spirals
“I was there over the
weekend. Please check behind
the cheeses aisle”
“Are you telling
me they don’t carry
strawberries?”
49. v
Solution that works reasonably well
• Shoppers can see Instacart recommended
replacements while shopping in the store
• Customers may also specify or choose from
recommended replacements
• Relatively more flexibility with groceries
• Some services offer to cancel the order if
an item isn’t available
50. v
Instacart Recommended Replacements
Flavor PackingSizeBrand Price
• Several product attributes matter
• Context matters, might benefit from personalization
• Must scale to millions of products
• Not always symmetric
• May be ok to replace X with gluten free X but not the other way around
Diet
Info
51. v
• Shoppers are trained to pick replacements
• But shoppers can benefit from algorithmic suggestions
• Many unfamiliar products in a vast catalog
• Validation for common products
• Finding replacements fast improves operational efficiency
Replacement Recommendations for Shoppers
52. v
• Customers can specify replacements while placing the order
• Can choose to communicate with the shopper in store to verify
Replacement Recommendations for Customers
53. v
What could we do if could predict item availability?
Customer
location
Nearest
store
Farther, but
better availability
Controlling for retailer and quality,
customer is indifferent to physical location
54. v
The Item Availability Prediction
Probability( Item in store | time, context)
What is probability that an item will be at
the store when the shopper shows up to
look for it?
55. v
Item Availability as a Classification Problem
TIMESTAMP, ITEM IDENTIFIED, IN STORE?
• Millions of examples from historical data
• Feature Engineering
• historical availability at multiple resolutions
• Eg: time since last “not found” event
• Item attributes
• Eg: perishables restocked differently than personal care
• Temporal Features
57. v
Serving and Optimization Layer
Fulfillment
Engine
Order
Fulfillment plan:
Store location, Shopper etc.
Items,
eligible store
locations
Availability
scores
Active in production with an acceptable
trade-off between
fulfillment efficiency and refund rate
58. v
Whats next
• Leverage model predictions for other features/data products
• Avoid negative feedback loops!
• Biased training data
• only have access to what is ordered through Instacart
• Tighter integrations with retailer data
• Scaling: continue to score a growing catalog at tight SLAs
61. v
Offline evaluation
• Ideally we want to evaluate user response to recommendations
• But we will only know this from an live A/B test
• Recall based metrics are an offline proxy (albeit not the best)
• Recall: “Fraction of purchased products covered among Top N
recommendations”
• We only use this for hyper parameter tuning
62. v
Ensembles
Use different types of evidence and/or product metadata to easily create ensembles
User x Products Purchased
User x Products Viewed
User x Brands Purchased
Model or Linear
Combination
…
64. v
Online ranking for diversity
“Diversity within sessions, Novelty across sessions”
“Establish trust in a fresh and comprehensive catalog”
“Less is more”
Cached list of
~1000 products
per user
Final list of
<100 products
promote diversity
65. v
Diversity
Top K products - ranked by score
Rank product categories by their median product score
> > >
66. v
Weighted sampling for diversity
Sample category in
proportion to score
Within category, sample in
proportion to product score
68. v
Out of stocks happen due to uncertainty in several places
Order fulfillment in (distant) future
Cannot hold inventory
Real-time inventory tracking across
thousands of locations isn’t perfect (yet)
Customer might reschedule delivery