Le Blog

Les collaborateurs d’Invivoo explorent et partagent leurs connaissances du domaines IT.

Explore and analyze your data with apache zeppelin – part 2

Welcome back to our second part about Apache Zeppelin. In ‘EXPLORE & ANALYSE YOUR DATA WITH APACHE ZEPPELIN - Part 1’ our previous post, we introduced Apache Zeppelin as one of the best Big Data tools to your Data Analytics use cases and shared details about...

Explore & Analyse your data with Apache Zeppelin

In order to gain insight and benefit from the growing amount of data generated both internally and externally by their sources and systems, companies need strong and reliable tools to reach that objective. Apache Zeppelin — an open-source data analytics and...

Creating your first Apache Airflow DAG

Throughout the past few years, Apache Airflow has established itself as the go-to data workflow management tool within any modern tech ecosystem. One of the main reasons for which Airflow rapidly became this popular is its simplicity and how easy it is to get it up...

Apache Airflow: What is it and why you should start using it

In this data-driven era, the number of open-source Big Data technologies rose exponentially in a matter of a few years. Because of this multitude of options, it results in the introduction of a vast range of patterns and architectures to store, process, and visualize...

Structured Streaming in Spark

Streaming processing is a set of techniques used to extract information from unbounded data (a type of dataset theoretically infinite in size) Some examples of streaming are device monitoring, fault detection, billing...

Discovering recommendation systems

WHAT ARE RECOMMENDATION SYSTEMS ? We all wonder how Amazon or Netflix came to such "power" and success? How can Netflix know about our movie preferences? How did Amazon know the unconditional Games of Thrones’ fan that I am, that I love The North Face and Geography?...

Résoudre vos problèmes d’encodage Unicode

INTRODUCTION L'encodage des caractères est quelque chose de transparent pour la plupart des utilisateurs. C'est même devenu tellement transparent avec l'avènement de l'astucieux format Unicode UTF-8 que même un programmeur peut se retrouver un peu perdu lorsqu'une...

Recommendation engine : from collective to personalized

The recommendation engine is at the heart of the business strategy of all e-commerce giants. For example, 35 percent of Amazon's e-commerce revenue is generated by its referral engine, according to a McKinsey study. We see every day the carousels of products that we...

Moteur de recommandation : du collectif au personnalisé

Le moteur de recommandation est au cœur de la stratégie commerciale de tous les géants du e-commerce. Pour preuve, d'après une étude de McKinsey, 35 % des revenus de la branche e-commerce d’Amazon sont générés par son moteur de recommandation. Nous voyons tous les...

A la découverte des systèmes de recommandation

C'est quoi un système de recommandation ? On se demande tous comment Amazon ou Netflix sont arrivés à ce "power" et ce succès ? Comment Netflix peut connaître nos préférences cinématographiques ? Comment Amazon a su que je suis un fan inconditionnel de Games of...

Kafka: the Big Data streaming platform

In modern information systems, we are confronted with ever-increasing volumes of data requiring to be processed in real time. However, the point-to-point connections commonly used do not allow easy loading scalability. Data producing services have a strong link with...

Notebooks are The Missing Piece of the Big Data Revolution

More than a decade ago, what is now commonly known as the Big Data era started with the emergence of Hadoop. Since then, a multitude of technologies were introduced to fulfill multiple tasks within the Hadoop ecosystem, with capabilities ranging from processing data...

How Big Data can contribute in reducing Banks’ churn rate?

How Big Data can contribute in reducing Banks’ churn rate?

According to a study conducted by Efma, one in two customers is willing to change banks in the next six months. The reason being? The lack of personalized products and services. At a time when competition between banks is raging, it is essential that they change their...

Kafka : la plateforme de streaming pour le Big Data

Au sein des systèmes d’informations modernes, nous sommes confrontés à des volumes de données à traiter en temps réel toujours plus important. Cependant les connections point à point couramment utilisées ne permettent pas de monter en charge facilement. Les services...

Patterns de streaming

Patterns de streaming

Nous aborderons dans cet article les différents patterns que les développeurs, avec leur responsabilité dans l’intégration de données, peuvent être amenés à mettre en œuvre dans le domaine du streaming. Mais tout d’abord présentons la plateforme qui a pour ambition de...

Monitoring and detection of anomalies with ELK

Monitoring and detection of anomalies with ELK

MEASURING PERFORMANCE INDICATORS WITH ELK Monitoring and measuring IT applications’ performance indicators are a major challenge for companies. The evolution of technologies around qualification, storage and processing big data as well as machine learning has made it...

5 pre-requisites before launching a Big Data project

A BIG DATA PROJECT IS FULL OF PITFALLS Many pitfalls! Who are not only on IT! During my visit to the Big Data Salon, held in Paris on March 6 and 7, 2017, I was able to attend to 14 feedback, informative and operational at a time. I propose a summary of this visit...