Posts

Arkhn’s Data Platform Goes Agentic February 24, 2025

I have been the CTO of Arkhn for two years now, during this time we’ve been building the best data platform to empower healthcare providers, make better decisions and improve patient outcomes.

Object oriented programming deemed irrelevant February 20, 2025

I've been coding since 2006, during this time I've seen multiple trends & technologies emerge, rise and fall - nowadays the elephant in the room is the bad press around OOP languages the likes of Java, C#, C++.

From Pandas to Apache Spark’s Dataframe July 31, 2015

RDDs are the new bytecode of Apache Spark May 29, 2015

With the Apache Spark 1.3 release the Dataframe API for Spark SQL got introduced, for those of you who missed the big announcements, I'd recommend to read the article : Introducing Dataframes in Spark for Large Scale Data Science from the Databricks blog. Dataframes are very popular among data scientists, personally I've mainly been using them with the great Python library Pandas but there are many examples in R (originally) and Julia.

Changing Spark’s default java serialization to Kryo January 9, 2015

Apache Spark's default serialization relies on Java with the default readObject(...) and writeObject(...) methods for all Serializable classes. This is a very fine default behavior as long as you don't rely on it too much...

Try Apache Spark’s shell using Docker December 18, 2014

Ever wanted to try out Apache Spark without actually having to install anything ? Well if you've got Docker, I've got a christmas present for you, a Docker image you can pull to try and run Spark commands in the Spark shell REPL. The image has been pushed to the Docker Hub here and can be easily pulled using Docker.

So exactly what is this image, and how can I use it ?

Apache Spark : Memory management and Graceful degradation December 11, 2014

Many of the concepts of Apache Spark are pretty straightforward and easy to understand, however some lucky few can be badly misunderstood. One of the greatest misunderstanding of all is the fact that some still believe that “Spark is only relevant with datasets that can fit into memory, otherwise it will crash”.

This is an understanding mistake, Spark being easily associated as a “Hadoop using RAM more efficiently”, but it still is a mistake.

Apache Spark : l’importance du broadcast November 27, 2014

Dagger and Play 2 Java July 28, 2014

How to remove scaladoc generation from Play 2.2.x Production dist June 17, 2014

Timeoff 2014 @ Lateral Thoughts April 14, 2014

Highlighting field in memory-based Lucene indexes June 24, 2013

I'm using more and more Lucene these days, and getting in depth on a few subjects, today i'm going to talk to you about how to handle the new Highlighting features available with Lucene 4.1.

How to test and understand custom analyzers in Lucene June 20, 2013

I've began to work more and more with the great “low-level” library Apache Lucene created by Doug Cutting. For those of you that may not know, Lucene is the indexing and searching library used by great entreprise search servers like Apache Solr and Elasticsearch.

Book review : ElasticSearch Server by Rafal Kuc, Marek Rogozinski June 17, 2013

Elasticsearch is the way March 12, 2013

Don't get me wrong, i love Apache Solr, i think it's a wonderful project and the versions 4.x are definitely something you should check out when building a proper search engine.

Reste à ta place et fais ce qu’on te dit. February 1, 2013

Je ne suis pas le plus aguerri des vétérans, et je m'en rend compte encore assez souvent pour savoir que j'ai encore des sempaïs dans plus d'un domaine (pas que technique) dont certains avec qui j'ai la chance de travailler, même si ce n'est pas tout le temps au jour le jour.

Sharing PyPi/Maven dependency data January 31, 2013

As time is always running out, i don't think i'll have the time in a while to work again on the data I collected for the last three articles, Going offline with Maven, State of the Maven/Java dependency graph and State of the PyPi/Python dependency graph.

Going offline with Maven January 14, 2013

At Lateral-Thoughts, we organize at least once a year, what we call a “Timeoff” where we get together in a nice place and hack on what we want. It can be a learning period or a startup weekend-like event where we hack on a product/idea. Last time it was in a nice house in Guérande where we had everything we needed, internet access, rooms, tables, lots of space, an indoor swimming pool and a barbecue !

State of the Maven/Java dependency graph January 11, 2013

So here it comes, the second part of a three part articles on dependencies in different world, the first part was about Python/PyPi dependencies and considering the size of the graph : 20661 Nodes, 14047 Edges, I was able to show you the graph in an interactive javascript app using SigmaJS. But this times it's different, after extracting the metadata from Maven repositories, the raw data file generated weights 273M, and the size of the whole directed dependency graph is 186 384 Nodes and 1 229 083 Edges, in other words, it's going to be tough to show you the whole graph interactively but the raw data, the graph file and the Gephi file are available on the GitHub project.

State of the Python/PyPi dependency graph January 5, 2013

I usually work in Java/Maven environment, so when I explain to people that Python also has a package manager - a bit less heavy than maven - and that it's working pretty well, I always have to answer the same question : “Ok, but how does it solve the transitive dependency hell ?”

New Year’s Python Meme 2012 December 28, 2012

Agora : Automatiser la démocratie dans une NoSSII December 16, 2012

c'est l'histoire d'un concept, vieux comme le monde, la démocratie.

La vie de startup September 21, 2012

Pourquoi venir à la présentation “Architecture CQRS et performance avec Django” au #pyconfr September 14, 2012

Bon déjà, c’est facile, c’est moi qui la ferait donc ça sera forcement intéressant (au moins un peu !), ensuite plusieurs points.

On devrait toujours travailler comme ça #hackathon LT September 13, 2012

Voilà le #hackathonLT “été 2012” est fini pour moi, c’était bien sympa. Difficile d’arriver à expliquer à tout le monde l’ambiance ou l’organisation, vu que le but était de ne pas vraiment en avoir… mais pour faire simple imaginer ça :

J’y vais… j’y vais pas… July 20, 2012

Le Cloud n’est pas la martingale June 8, 2012

Je vois de plus en plus de devs et autre afficionados technolo-geek commencer à ne jurer que par le cloud, alors à défaut de présenter une vision partiale et totalitairement contre, ce que je ne suis pas, j'aimerais bien tempéré un peu ces ardeurs.

Snow leopard and Qt/PyQt 4.8.x won’t work January 17, 2012

Handle Celery-dependent tests in Django and with django-jenkins January 15, 2012

So in your life, one of these days, you're going to realize you need tests, and that “maybe” you also need to test components that depend on several Celery tasks.

La puissance et le contrôle September 24, 2011

En développement, comme dans beaucoup d'arts martiaux, on peut devenir fort assez rapidement. On peut se fixer des objectifs (une ceinture, une victoire / maitriser une technologie ou réaliser un projet perso) et les atteindre rapidement selon le language, le maître et l'implication qu'on y met.

IncidentsRATP – IncidentsTransports est publié June 8, 2011

Voilà c’est fait, après maintes péripéties et une attente longue je l’avoue, le projet est enfin publié en AGPL v3.

Etat de fait et fatalité May 31, 2011

Un état de fait m'énerve, il m'énerve d'une part parce que dans d'autres domaines professionnels, il n'est pas aussi exacerbé, et d'autre part parce que je pense que la tendance s'accélère.

Should i really learn Java ? May 1, 2011

So i’ve been a professional Java developper for a few years now, and the question seems interesting to me. Should, let’s say a student, really learn Java even if he does not want to do “Enterprise applications” ?

Short answer, Yes.

TOR et HADOPI March 28, 2011

Juste pour faire un petit point autour de la HADOPI et plus précisément de la position de la HADOPI envers les proxy, et plus spécifiquement de la distribution des proxy comme le fait TOR.

How to be a happy programmer (with Python) ? 2/3 March 25, 2011

In the series of the Python “features” that makes me happy last time i began with two concepts, the with statement and the list comprehensions, now i'm going to talk about Multiple assignments and the import aliases.

Using TOR with Python March 25, 2011

There are many occasion where you may be limited using your own IP address, i will obviously only refer myself to “rightful” cases where you need to use different IP address in very short lapse of time. Let's say you want to test your website localization functionality, or just access it using many different IP address and see how the system deals with it.

How to be a happy programmer (with Python) ? 1/3 March 18, 2011

I've just watched Hillary Mason's talk in Pycon 2011 : http://pycon.blip.tv/file/4878710/
And that got me thinking about all the python constructs that makes my day better, and i decided to make a list of them and their meaning.

How to debug Django using the Python Debugger PDB March 15, 2011

Even if that seems common sense, i found out that there's not that much sources that explains how to use PDB with Django's bundle webserver. So here we go, let's say you have some treatment like that :

[IncidentsRATP] Suite et bientôt fin March 10, 2011

J'avais déjà publié à la fois la lettre de la RATP me mettant en demeure de fermer le site incidents-ratp.com et mon ressenti sur toute cette affaire, mais là l'histoire continue.

IncidentsRATP – Application IPhone February 27, 2011

On peut dire beaucoup de choses, mais pas que l'on chôme. Une application IPhone développée par Benoît Clouet est en préparation et bientôt prête pour être testée. Nous avons beaucoup travaillé ensemble sur la communication avec l'API du site, et voici un petit screencast de présentation de la version de test actuelle (chargement un peu long) :

Quelques chiffres – Revue de presse February 25, 2011

J'aime beaucoup les chiffres, mais surtout les chiffres bien utilisés. J'ai eu très peu de temps à consacrer à ce blog ces derniers jours, et beaucoup de temps pris par les interviews et la communication de manière générale. C'est nouveau pour moi dans le sens où pour une fois mes propos sont très largement et systématiquement déformés dans beaucoup des medias. Je ne blame personne et le message que je tenais à faire passer est “globalement” bien ressorti, donc je ne vais pas me plaindre.

IncidentsRATP – La lettre February 20, 2011

La, maintenant célèbre, lettre de la RATP que j'ai reçu me demandant d'arrêter toute activité sur incidents-ratp.com. J'ai bien sûr retiré toutes références aux personnes et adresses y compris le personnel de la RATP.

IncidentsRATP – pré Post-mortem February 20, 2011

Depuis quelques mois je travaillais sur un projet que j'avais intitulé incidents-ratp.com avec un but simple : mettre à disposition une plateforme libre et ouverte pour connaître les incidents en temps réels dans les transports en commun.

FOSDEM 2011 : Day #2 February 12, 2011

Principal talk de la journée - Facebook messages avec HDFS/HBase :
Une présentation du projet de Facebook d'unifié tous les messages (SMS, Email, Chat...).

FOSDEM 2011 : Day #1 February 6, 2011

La journée commence bien, petit déjeuner à l'hôtel, et voyage vers le FOSDEM. Petite grasse matinée oblige, on loupe les premières conférences sur la liberté des logiciels et la liberté tout court pour arrivé dans un amphi gigantesque, mais surtout plein à rebord.

FOSDEM 2011 – Day 0 February 5, 2011

Vendredi soir, après une longue journée de boulot, et un petit voyage en train d'1h30, rendez-vous Bruxelles pour une soirée sympa au beer event du FOSDEM 2011.

FOSDEM 2011 February 3, 2011

Le FOSDEM 2011 (Free OpenSource Developer European Meeting) aura lieu ce weekend du 5 février à Bruxelles !

Agilité et BuildWall December 1, 2010

Recharger son .profile sous Linux/Mac November 30, 2010

Etat de l’art de la GED OpenSource November 16, 2010

Data Extraction without semantic web November 2, 2010

Moving On… October 29, 2010

Ca y est ! Fin de mission aujourd'hui.

A good oldie : 1996 Java Vs Python October 28, 2010

Un hébergement qui vaut le détour : AlwaysData October 27, 2010

Ok, ce n'est très certainement pas mon domaine d'expertise, mais je suis un programmeur Python, principalement sur Django et je cherche souvent un hébergement.

Combien d’applications lourdes utilisez-vous ? October 26, 2010

Fin du StartupWeekend Paris 10/2010 October 12, 2010

Ca y est l'édition d'octobre 2010 du Startup Weekend de Paris qui c'est tenu à Télécom Paris est terminée. C'était ma première fois, et comme le promettait l'affiche, c'est clair que je reviendrais.

Ce que je veux apprendre October 6, 2010

OpenOffice.org is officially forked : LibreOffice 3 September 28, 2010

I honestly have been waiting for this since the first day i met with Thorsten Behrens and worked with him and Eric Bachard on improving OOo Impress.

Here’s what happens when agile methods are whispered September 22, 2010

Citation September 21, 2010

[Django] Append objects in request.session September 17, 2010

This article is once again more of a reminder to me, i hope it will help everyone at the same time.

De l’intégration des nouveaux arrivants September 16, 2010

De la différence entre innovations September 15, 2010

Citation September 14, 2010

Paris Startup Weekend 8-10 Octobre 2010 September 13, 2010

Weekend… September 12, 2010

Citation September 10, 2010

Comment créer des sous-formulaires à partir d’un Model Django September 9, 2010

Evitons Javascript September 8, 2010

Autodidactes en informatique September 7, 2010

S'il y a bien une constante que je retrouve à travers toutes mes expériences professionnelles, c'est qu'on vient bien de partout pour finir dans l'informatique. Je m'explique :

Création de modèles avec PySide/PyQt September 5, 2010

Ça y est ! mon dernier article sur développez.com est terminé, relu, vérifié, contre-vérifié etc.

Introduction et prise en main de PyQt/PySide August 26, 2010

MultiThreaded Test cases – Why ? How ? and What’s for dinner ? August 13, 2010

Okay so to make a small introduction (i promise a tiny one) i’m starting to contribute into VirgoRT (ex-SpringSource DM Server) for me it’s an opportunity to work with great people, learn more about OSGi and work on a real JEE application server (and more).

Building Virgo on MacOsX Leopard July 28, 2010

I won’t say that my contribution to this work will be tremendous, but it’s nice to know a little more if you want to try out building the ex SpringSource DM Server

Blog au(tour) du monde April 6, 2010

It’s like that – always the communists’ way March 23, 2010

Machine-learning empowerment – Apache Mahoot March 23, 2010

Maybe we’ll soon be able to get more out of machine learning algorithms, Google is communicating a lot, throught the GSOC (Google Summer of Code) and Google Code as a whole, on this project and supporting the Apache foundation.

Self-Improvement : Refactor my code !! March 16, 2010

Don’t forget to subscribe ! March 15, 2010

Self-Improvement : [Java] – Weak references (Soft/Phantom) March 15, 2010

That’s official – Be Stupid March 10, 2010

Self-Improvement : [Java] – ClassLoading in depth February 28, 2010

That’s when your dream comes true… January 9, 2010

Sur le site d’une grande banque… December 9, 2009

voilà ce qu’on pourrait trouver, alors qu’on essaye sans succès sur une page Web de changer son mot de passe :

One more down, many to go… December 5, 2009

After the court decision about thePirateBay, now it’s the time to be sad for Mininova,

Just because i haven’t done anything lately… December 1, 2009

and because i’m deep into the last parts of a new release (for my job), here’s a little thought i’d love to share :

Metasploit is Rising November 23, 2009

Petite pause japonaise November 22, 2009

Le paradoxe de l’analyste programmeur ? November 18, 2009

Useful Tips : StringBuilder and Java String Concatenation November 14, 2009

Enfin la réponse à l’ultime question… November 11, 2009

Internet comme substitut aux relations humaines November 8, 2009

Unfinished business… and decision making November 2, 2009

Design Pattern : Proxy October 30, 2009

Advanced use of Eclipse for Java October 27, 2009

Barry Schwartz : On the paradox of choice October 24, 2009

Une leçon pour Alain Finkielkraut sur Internet October 21, 2009

Did you know ? October 19, 2009

Looking for thesis and inspired science ? October 16, 2009

Google.com is a …. spell checker !? October 13, 2009

To discover : School Food Punishment October 8, 2009

Better web applications October 2, 2009

NoLife-TV : Y’a pas que la vraie vie dans la vie September 29, 2009

If architects had to work like software developers September 26, 2009

New moto ? : Be Nice or Leave September 23, 2009

Design Pattern : Singleton September 6, 2009

A découvrir Tokyo Magnitude 8.0 September 5, 2009

Expression régulières Vs State Machines September 5, 2009

Précis de SQL : utilisation de “Minus” September 5, 2009

Alors qu’il fait chaud… August 22, 2009

Rebuild of Evangelion dévoile son deuxième épisode July 18, 2009

Modélisation de la C.A.F par des Threads asynchrones July 17, 2009

Cheat sheet for vim/gVim July 9, 2009

RMLL’09, OpenOffice.org, LOPSI/HADOPI et le futur July 9, 2009

English report of the OpenOffice.org project : The Eraser July 2, 2009

Useful tips : Insert a clob of more than 4k into database June 25, 2009

RMLL 2009 – OpenOffice.org and OpenSource lectures June 23, 2009

Introduction aux produits structurés June 22, 2009

Java Web start/JNLP and file access from within June 22, 2009

New blog and Macbook Air review after 5 months June 21, 2009

Useful Tips : How to create a thread that must get a result with a timeout time June 9, 2009

Les élections européennes ne passionnent pas la télévision, peu importe June 7, 2009

Useful Tips : How to insert BLOB into an Oracle database using JDBC June 5, 2009

Useful Tips : How to find the biggest files on a unix system ? June 3, 2009

Args4j ou de l’utilisation de Java pour faire des Batchs June 2, 2009

Déformation professionnelles en Informatique comme ailleurs May 28, 2009

La déformation professionnelle est un concept, il faut bien l’avoué, typiquement français qui consiste à dire qu’un professionnel verra toujours sa vision du monde influencée par le type de travail qu’il réalise.

Langage de programmation : Comme si les performances comptaient … May 13, 2009

Comme j’anticipe le gros troll qui pourrait découler d’un tel titre, je vais préciser ma pensée : Je ne veux pas dire qu’il ne faut pas optimiser son code autant que faire se peut et qu’il n’est pas nécessaire de toujours réfléchir au plus performant.

Lire un Dvd sous Ubuntu – Cas d’école de piratage May 3, 2009

Petit tour à Luxembourg Ville – Canards et paysages April 25, 2009

La fusion Oracle – Sun signera-t-elle la fin des conflits entre Eclipse et Netbeans ? April 23, 2009

Introduction au langage Natural et problematiques de migration vers Java April 23, 2009

Petit précis de Luxembourgeois April 22, 2009

Symfony, sfPropelPager et passage de variables March 21, 2009

Petits bugs entre amis … March 17, 2009

Comment detruire Skynet ?? March 10, 2009

TCP/IP – Rudiments d’analyse de paquets et d’intrusions March 9, 2009

Validateur de code SIREN pour symfony March 4, 2009

Faire un “Select DISTINCT …” sous Symfony avec Propel March 3, 2009

Finding a way to compile OpenOffice.org February 26, 2009