• Martin Thoma
  • Home
  • Categories
  • Tags
  • Archives
  • Support me

Recent Posts

Data Applications

Data Applications

"Data is the new oil", "we need to be data driven", "we need to apply AI to keep being competitive" are some of the prashes I hear a lot. As I haven't seen yet a clear article pointing out what is done with the data ... here you are 🙂 Why it's … Read More »
Ways to store Data

Ways to store Data

This is an article I had for quite a while as a draft. As part of my yearly cleanup, I've published it without finishing it. It might not be finished or have other problems. Data is one core element of machine learning. Hence it is worth to think about ways … Read More »
Exploratory Data Analysis

Exploratory Data Analysis

Getting insights from data is exciting. So let's see how well I can cover this topic in a single article. In this article, I assume you have data in a single CSV file. If you have multiple CSV files, you can merge them similar to SQL JOIN statements. Prerequesites Python … Read More »
How to get Data for ML systems

How to get Data for ML systems

Machine Learning is only possible with data. The more data, the better. For many services this is a self-improving system. The more data the system gets, the better it becomes. The better the system is, the more users use it. The more users use the system, the more data the … Read More »
Google Ngram Viewer

Google Ngram Viewer

On October 14, 2010 Google announced that the number of scanned books is over 15 million. They did not simply scan those books, but they digitalized them. They can access not only image files, but the actual text. This allows Google to search in those books and to analyze the … Read More »
Data Visualization

Data Visualization

The United States public debt increased from \$10.7 trillion in 2008 to \$14.2 trillion by February 2011. Google processes about 24 petabytes of data per day. About 21.9 people live in Mumbai. Today it is incredibly easy to gain raw data, but without context they aren't worth … Read More »
  • Martin Thoma - A blog about Code, the Web and Cyberculture
  • E-mail subscription
  • RSS-Feed
  • Privacy/Datenschutzerklärung
  • Impressum
  • Powered by Pelican. Theme: Elegant by Talha Mansoor