How to Build, Train and Deploy Your Own Recommender System â Part 2
We build a recommender system from the ground up with matrix factorization for implicit feedback systems. We then deploy the model to production in AWS.
I still remember the moment that piqued my interest in Data Science. It was probably a couple of years ago, I saw this segment in a popular HBO sitcom - Silicon Valley, and I wasnât even following the series. I donât remember, but for some reason I found myself watching this on Youtube. So hilarious when Jian Yang first demoed the Not Hotdog app. But when I discovered that it was actually a real app that they developed for the series, I couldnât resist, but I had to find out exactly how they made it.
This week, we spent more time getting deeper experience with Pandas, how data scientists use it to slice and dice data and effectively use it for exploratory data analysis. As we get to use it more, we get the appreciation of how indespensible it is at the stage of this data science end to end process. And one can undestand why data scientists love using Jupyter notebooks at this stage in the process too.
The course instructor always talks about that in data science, one needs to build this intuition, of being able to find a problem that is worth solving where itâs solution has an impact as well as identify if the data we have available is of good quality. And that we can have all the volume of data we want, and if it is no good, then they still belong in the rubbish bin. My goal, by the end of this course, is to not only complete the Capstone project, but more importantly, to be able to understand at least how to achieve that intuition that he keeps on talking about.
With the Capstone Project proposal due at the end of next week, Iâve been thinking about different options, inspecting several available public datasets, and researching problems that people (myself included) are experiencing that can be solved with data science. Because my data science intuition needs some improvement, itâs also worth noting that I need to come up with a few ideas, since not all are good ideas, or are problems that are able to be completed with the limited time available to me by the end of the bootcamp.
The following are the possibilities:
Iâve been interested with pricing related Amazon data since I looked into Amazon Fulfilment by Amazon (FBA) a while back. There are several problems that 3rd party sellers would want answers for such as:
Australian house prices are notorious worldwide for being overpriced and unreachable for many. There is a public data available from Australian Bureau of Statistics that show historical property price index for different states from mid 2000 up to the present. From this information, in combination with data from other datesets, we want to:
Ever since the first season of Drive to Survive, Iâve been captivated by the drama and excitement that is Formula 1. Iâve been consuming this public API in some of my past blog posts (DynamoDB and Single-Table Design, Simple GraphQL consumer with Apollo Client) and I thought it was fitting to continue this trend and explore the instights that can be gleaned from it:
As I have been dabbling in marathons and triathlons, on and off through the years, this is also one my interests. For years I have been wondering:
I will be submitting my Capstone Project proposal at the end of next week, and the ideas I have presented above, in one form or another will most probably be it!
We build a recommender system from the ground up with matrix factorization for implicit feedback systems. We then deploy the model to production in AWS.
We build a recommender system from the ground up with matrix factorization for implicit feedback systems. We put it all together with Metaflow and used Comet...
Building and maintaining a recommender system that is tuned to your businessâ products or services can take great effort. The good news is that AWS can do th...
Provided in 6 weekly installments, we will cover current and relevant topics relating to ethics in data
Get your ML application to production quicker with Amazon Rekognition and AWS Amplify
(Re)Learning how to create conceptual models when building software
A scalable (and cost-effective) strategy to transition your Machine Learning project from prototype to production
An Approach to Effective and Scalable MLOps when youâre not a Giant like Google
Day 2 summary - AI/ML edition
Day 1 summary - AI/ML edition
What is Module Federation and why itâs perfect for building your Micro-frontend project
What you always wanted to know about Monorepos but were too afraid to ask
Using Github Actions as a practical (and Free*) MLOps Workflow tool for your Data Pipeline. This completes the Data Science Bootcamp Series
Final week of the General Assembly Data Science bootcamp, and the Capstone Project has been completed!
Fifth and Sixth week, and we are now working with Machine Learning algorithms and a Capstone Project update
Fourth week into the GA Data Science bootcamp, and we find out why we have to do data visualizations at all
On the third week of the GA Data Science bootcamp, we explore ideas for the Capstone Project
We explore Exploratory Data Analysis in Pandas and start thinking about the course Capstone Project
Follow along as I go through General Assemblyâs 10-week Data Science Bootcamp
Updating Context will re-render context consumers, only in this example, it doesnât
Static Site Generation, Server Side Render or Client Side Render, whatâs the difference?
How to ace your Core Web Vitals without breaking the bank, hint, its FREE! With Netlify, Github and GatsbyJS.
Follow along as I implement DynamoDB Single-Table Design - find out the tools and methods I use to make the process easier, and finally the light-bulb moment...
Use DynamoDB as it was intended, now!
A GraphQL web client in ReactJS and Apollo
From source to cloud using Serverless and Github Actions
How GraphQL promotes thoughtful software development practices
Why you might not need external state management libraries anymore
My thoughts on the AWS Certified Developer - Associate Exam, is it worth the effort?
Running Lighthouse on this blog to identify opportunities for improvement
Use the power of influence to move people even without a title
Real world case studies on effects of improving website performance
Speeding up your site is easy if you know what to focus on. Follow along as I explore the performance optimization maze, and find 3 awesome tips inside (plus...
Tools for identifying performance gaps and formulating your performance budget
Why web performance matters and what that means to your bottom line
How to easily clear your Redis cache remotely from a Windows machine with Powershell
Trials with Docker and Umbraco for building a portable development environment, plus find 4 handy tips inside!
How to create a low cost, highly available CDN solution for your image handling needs in no time at all.
What is the BFF pattern and why you need it.