Posts

Nov
21
How to create a Devcontainer for your Python project 🐳 (on Xebia.com ⧉)

How to create a Devcontainer for your Python project 🐳 (on Xebia.com ⧉)

Instead of giving other developers a setup document, let’s make sure we also create formal instructions so we can automatically set up the development environment. Devcontainers let us do exactly this.
10 min read
Oct
22
pyspark-bucketmap

pyspark-bucketmap

pyspark-bucketmap is a tiny module for pyspark which allows you to bucketize DataFrame rows and map their values easily.
2 min read
Jul
27
DropBlox: Coding Challenge at PyCon DE & PyData Berlin 2022 (on Xebia.com ⧉)

DropBlox: Coding Challenge at PyCon DE & PyData Berlin 2022 (on Xebia.com ⧉)

Conferences are great. You meet new people, you learn new things. But have you ever found yourself back in the hotel after a day at a conference, thinking what to do now? Or were you ever stuck in one session, wishing you had gone for that other one?
8 min read
Apr
18
From Linear Regression to Neural Networks

From Linear Regression to Neural Networks

How are linear regression, logistic regression and neural networks related? What is overfitting and how do we fight it? In this post, we find answers to these questions in an interactive way by working with a real-world dataset on penguins.
20 min read
Apr
09
Making Art with Generative Adversarial Networks

Making Art with Generative Adversarial Networks

Can computers make art? To find out, we tried ourselves. We used Generative Adversarial Networks to try to paint new Van Gogh paintings.
2 min read
Jan
17
Finding 'God' components in Apache Tika

Finding 'God' components in Apache Tika

How did big, bulky software components come into being? In this project, we explore the evolution of so-called God Components;
2 min read
Oct
29
Backdoors in Neural Networks

Backdoors in Neural Networks

In this project, we demonstrated how Neural Networks can be vulnerable to a Backdoor attack.
4 min read
Mar
01
COVID-19 Dashboard

COVID-19 Dashboard

What parts of the world are susceptible to Corona outbreak? We used Big Data and Data Engineering in this project to find out.
2 min read