Thu Jun 02 2022

Data Science NL meets Data & Drinks

Raamstraat 7-I, 1016XL Amsterdam & online

Talk 1: Product recommendation and the Dutch movie world
This talk will discuss two product recommendation approaches, one in a "hobby-project" restaurant setting, and the other in a more serious web/app transactions setting. We'll finish off with an insight into the Dutch movieworld by using some simple graph techniques.

About Longhow Lam: Longhow Lam is a freelance data scientist. He has worked on several projects at different companies. Among others: Customer Churn at Parkmobile, text mining complaints at SVB, predicting customer mortgage default at ING, customer clustering and insights at Pearle.

Talk 2: Machine learning with limited labels: How to get the most out of your domain expertise?
The success of a machine learning project often relies on the availability of good-quality labeled data. Machine learning models learn by seeing examples of the data, and both the number of examples and their quality make a difference in how well the model learns. Therefore, putting effort into obtaining a large number of examples with corresponding labels can be of big help in training your machine learning model. This, however, can be quite difficult in practice.

Samantha will speak about how we can use active learning and weak supervision to turn an unsupervised problem into a supervised one when labeling is difficult.

About Samantha Biegel: Samantha is a Machine Learning Engineer at Xomnia. She graduated cum laude in MSc. Artificial Intelligence and worked on numerous machine learning projects within the energy sector, financial sector and commercial sector. She enjoys working on making machine learning accessible to real-world problems and is passionate about developing trustworthy AI systems for a positive societal impact. These are some reasons why she is also currently working on an object detection pipeline for drone imagery to findrhino poachers in South Africa.

Talk 3: Video Game Programming for Data Science
In this talk, Jeroen will introduce Raylib, demonstrate how to use it from Python and R, and discuss its potential. Raylib is a C/C++ library for programming video games. This includes working with 2D & 3D OpenGL graphics, sounds & music, keyboard & mouse interactivity, and even gamepads & VR headsets. You can use Raylib in many languages, including Python, Julia, and JavaScript. And soon, you’ll be able to use all of Raylib’s functionality directly from R, thanks to a package called raylibr that I’m currently developing. I’m not expecting any Triple-A games, but I do believe that having such functionality is useful, especially for research, education, and simply having fun.

About Jeroen Janssens: Jeroen Janssens, PhD, is a data science consultant and certified instructor. His expertise lies in visualizing data, implementing machine learning models, and building solutions using Python, R, JavaScript, and Bash. He’s passionate about helping and teaching others to do such things. Since 2013, Jeroen runs Data Science Workshops, a training and coaching firm that organizes open enrollment workshops, in-company courses, inspiration sessions, hackathons, and meetups. Previously, he was an assistant professor at Jheronimus Academy of Data Science and a data scientist at Elsevier in Amsterdam and various startups in New York City. He is the author of Data Science at the Command Line (O’Reilly Media, 2021). Jeroen holds a PhD in machine learning from Tilburg University and an MSc in artificial intelligence from Maastricht University. He lives with his wife and two kids in Rotterdam, the Netherlands.

About the speakers

Jeroen Janssens

Educator

Longhow Lam

Freelance Data Scientist

Samantha Biegel

Machine Learning Engineer