Understanding what the user means + mapping that to our inventory
Top tips for running a data science journal club
At Lyst we’ve been improving our testing environments over the last year or so, and one of the main elements we wanted to improve was our testing stack with Selenium. We’ve used Selenium in the past, but the tests grew old, were poorly maintained, and few people could work out how they worked after our shift to Docker (read more about that in a previous post.)
Unless you've been under a rock in the Twitter world for the last week - you will have seen the #ILookLikeAnEngineer hashtag. Here at Lyst, we have some brilliant engineers - many of whom are women. We decided we should tell you all a little bit more about ourselves, how we came to engineering, and what advice we have for women wanting to be engineers themselves.
Nearly half of the staff at Lyst are technical or have a technical background. We have a large technology stack and plenty of exciting projects that we’re working on. But we’re often so focused on developing great experiences that we don’t get the time to share what we’re doing with you.
Our engineering team is taking a short hop to mainland Europe this July to attend EuroPython 2015 in sunny Bilbao, Spain. We’ll be spending six days with fellow Pythonistas from all across Europe (and even the world!) and attending over 200 sessions, workshops, and social events.
Nearest neighbour search is a common task: given a query object represented as a point in some (often high-dimensional) space, we want to find other objects in that space that lie close to it. For example, a mapping application will perform a nearest neighbours search when we ask it for restaurants close to our location.
Bayesian analysis of A/B tests is a great way of getting reliable inference. Except, of course, when we get our priors horribly wrong.
We process millions of fashion products a day from over 500 retailers. One of the goals of the data-team is to transform this stream of semi-structured data into one consistent product catalogue. Colour is one of the most difficult fields to normalise. In this post we discuss how product colors are derived from product images.
We process millions of images using an ecosystem of classifiers. In order to get the most information out of an image, it is best to remove the background as it may contain data which will make the classifier less accurate. In this post we discuss methods of removing backgrounds from images.