0% found this document useful (0 votes)
52 views

Microsoft PM Engage Case IIM Shillong: Problem Definition

The document proposes integrating video recording and processing capabilities into the Microsoft Lens application. This would allow users to record educational videos, extract key information from them through computer vision and natural language processing, and organize the information across Microsoft 365 apps like OneNote, Word, and Excel. The feature aims to help students take better notes from lectures and field trips. Success would be measured by increased user engagement with Lens and higher retention rates after the new video feature is added.

Uploaded by

pushpak maggo
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
52 views

Microsoft PM Engage Case IIM Shillong: Problem Definition

The document proposes integrating video recording and processing capabilities into the Microsoft Lens application. This would allow users to record educational videos, extract key information from them through computer vision and natural language processing, and organize the information across Microsoft 365 apps like OneNote, Word, and Excel. The feature aims to help students take better notes from lectures and field trips. Success would be measured by increased user engagement with Lens and higher retention rates after the new video feature is added.

Uploaded by

pushpak maggo
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Microsoft PM Engage Case

IIM Shillong
Pushpak Maggo (2022PGP284)

Sukirat Singh Malhotra (2022PGP298)

Problem Definition
Our Ideal Consumer is anyone utilising the Video Platforms to learn i.e. Video Lectures, Tutorials,
Infographics videos etc. Video recording and processing feature needs to be incorporated into the
Microsoft lens application. Various capabilities of the Microsoft 365 ecosystem such as Intelligent
character recognition and text to speech feature needs to be integrated into the development. The
video processing feature needs to be smoothly integrated with the already established Microsoft
lens’ image processing aspect. The processing of the video should have an accuracy of at least 95 per
cent so that the users do not have any hindrance while using the feature.

Customer value – The target persona, that is the adults in the age group of 18-26, usually consume
tons of visual content on various subject matters for educational purposes. Making notes out of the
video lectures or taking photo screenshots is a cumbersome process and not an effective way to
record lectures systematically. Our new Video integration is going to make the process quite easy
with the use of use the video recording feature of their smartphones to archive various information
and data points. The trend of the way of recording information has regularly been shifting from text
to images to nowadays videos.

The main issues arise when useful information needs to be pulled out from the recorded video.
Sorting of video has become a cumbersome process and most users do it manually. This creates
duplication of efforts and results in lower productivity.

Business value – Microsoft, being at the forefront of innovating software applications is always
working on improving the user-friendliness of their apps, increasing the productivity of the users
would exponentially benefit the brand and through the integration of M365 apps, It would be able
to increase the overall productivity of the MS ecosystem and user base of its application. It can
further lead to an increase in revenues as it has the potential of becoming a useful feature for other
enterprises.

Goals & Non-goals


We have discovered different scenarios which are as follows:

Recording video tutorials and offline classes.

Students: Students regularly face problem of not revising the lecture they have attended, whether it
be online or offline. Teachers emit invaluable knowledge which at times can’t be noted by the
students in full capacity. They need a platform to record the lectures so that they can revisit again.
Administration of an institution: The administration staff need to keep record of the lectures by the
professors. They also need to record to ensure that they are imparting quality education to the
students.

Blind students – visually impaired students cannot read various resources shared in live online
classes or knowledge imparted via whiteboard in the offline based classes.

Video recordings during on-site visits for educational purposes:

Students: They regularly go to on-site visits to various factories and museums to get first-hand
knowledge of their subject matter. They record videos to capture the nitty gritty of the place visited,
along with the knowledge imparted by the instructor. But they couldn’t go back to that video for
reference as it is not well organized and ordering it in a structure will consume a lot of time.

Feature Definition
Students:

The students have different options to input video into the lens application. These are as follows:

1. Import videos from the library or MS stream.

2. There is also an option to directly record the video on the Lens app.

3. Lens app can run in the background and can record the lecture happening on teams or any
other app via screen recording.

Once the Video is uploaded to the lens it will convert it into several pictures that is the Frames per
second. Suppose a video is shot on 30fps so a 1-minute video will be 1800 frames then the AI will
come into play and eliminate the almost identical frames which would significantly reduce the
number of frames the Lens need to evaluate.

Then, it will delete the frames that don't have any data or negligible data. After the frames are
cleansed then each frame is analysed by the Lens app and data is extracted in a systematic manner
in separate slides or worksheets or whatever platform is chosen by the consumer.

Consequently, for the audio recorded in the video of the online class, the application would process
the input to transcript the lecture and it would automatically be jotted down in either Word,
OneNote, or Excel by making use of speech to text feature of Microsoft.

The transcripts would be aligned with slides made earlier. The student can either read the
transcripts and side by side refer the slides to better understand the whole lecture. This will organize
the video in an understandable format which will lead to better comprehension of the whole subject
discussed in the class.

As for offline lectures, the slides of the whiteboard will be made, and the transcripts will be
synchronized with the board.

For on-site visits by the students, lens app can be very useful. They can record the whole visit by lens
app.

The details of the place visited will be captured, along with the audio recordings of the instructor.
Different slides will be made of the visit and the transcripts of the instructor will be synchronized
with the slides. Based on the difference of timestamps the frames are coming which will make the
documentation much better.

Blind students:

The input video can also be processed in a different manner to help visually impaired students. In
the online lectures, various resources such as PowerPoint presentations and word documents are
shown by the lecturers.

Via the lens application, these resources would be captured and can be converted to rich text
format. That data can be converted to speech via text to speech feature of Microsoft. This will be
extremely helpful for the especially abled students as they can also study those resources without
the help of anyone.

Integration of M365 apps-

Importing-We will be integrating MS Stream which has live streams as well the On-demand videos
where the user can export videos from MS Stream directly to the Lens which will enable them to
extract notes, slides, numbers, table or diagrams shown to them either in the class lectures or in
corporate training that they could import to Lens in order to extract Insightful notes and diagrams
that could be used by them for various purposes.

Exporting- The data extracted through Lens could easily be categorized as Text, Diagrams or
Numbers which could easily be exported to the relevant MS Platforms i.e. Word for Text heavy
content, Excel for numbers etc or it could collectively be exported to OneNote as well. All the data
extracted from the Lens will further be uploaded on OneDrive which would ensure that data is
available across all the hardware.

Success Metrics
Feature Success Metrics
Number of User actions per session- Considering each frame in the video as a separate action we
could analyse the capabilities of our features with the number of actions taken by a unique user.

Number of Video Imports in the App- The number of video imports will only be utilized by the newly
introduced feature thus, analysing it would conclude the success of our new integration.

Overall Success Metrics


Daily Active Users / Monthly Active users’ ratio- Active users are someone who interacts with the
product, we can analyse the DAU and MAU before the launch of our video feature and post the
launch as well to calculate an approximate number the additional unique users attracted through
this feature.

Session Duration- Average session duration is the average time spent by a user on the Lens app,
comparing the average user time pre and post the integration of our video feature will help us
identify how our new feature is providing value to the consumers.

Retention Rate- Retention rate is used to calculate how many consumers our app can retain in each
time period.
Rating on App stores- Overall user-friendliness and relevance of the app could be analysed through
the rating and review of the app on Mobile App Stores.

Workflow

You might also like