Author Archive

Data is never clean. You will spend most of your time cleaning and preparing data. 95% of tasks do not require deep learning ( or other forms of machine learning). In 90% of cases generalized linear regression will do the trick. Big Data is just a tool. You should embrace Bayesian approach. No one cares how you did it. Academia and business are two different worlds. Presentation is key – be the master of Powerpoint. All models are false, but some are useful. There is no fully automated Data Science. You need to get your hands dirty.

Thank you to Fabien of Power Data Group for endorsing the ideas and vocalising the need of managed data in an increasing connect world with enterprises hungry for insights. You can have all the tools, and systems, but it takes careful planning, consultants and data to get insights and reports flowing.

Glitchdata welcomes Power Data Group as a distributor of our products.

 

Hi! It’s been a busy last couple of weeks, with trips to asia, a slew of conversation with friends and honing of concepts around Glitchdata. However, it is done and I am pleased to present “Glitchdata“.

Until now, Glitchdata has been a community of data architects focused on data integration. We all breath data, and over the course of work it is becoming increasingly apparent that the main problem faced by the data industry is data itself. As such, we are now focused on master data management and the distribution of managed datasets.

So watch this space for news, insights and ideas on this newly minted startup. Feel free to give us feedback, and support us via donations or purchasing our datasets.