Posts in category

Data


Hijacking IP addresses is an increasingly popular form of cyber-attack. This is done for a range of reasons, from sending spam and malware to stealing Bitcoin. It’s estimated that in 2017 alone, routing incidents such as IP hijacks affected more than 10 percent of all the world’s routing domains. There have been major incidents at Amazon and Google and even in nation-states — a study last year suggested …

The late data visionary Hans Rosling mesmerised the world with his work, contributing to a more informed society. Rosling used global health data to paint a stunning picture of how our world is a better place now than it was in the past, bringing hope through data. Now more than ever, data are collected from …

You often hear Type I and Type II errors in statistics classes. There is good reason for that — minimizing either of these two errors is pretty much the core of statistical theory. Preliminaries Type I and Type II errors are related to the concept of hypothesis testing. In hypothesis testing, we have two hypotheses: …

Overview This guide shows how to install Pandas. Pandas is an open source library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language   Prerequisites Python has been installed Installation on Ubuntu Installation on Windows Optional but recommended. Setup a VirtualEnvironment and Pip has been installed. VirtualEnvironment for Ubuntu   …

From Jeopardy winners and Go masters to infamous advertising-related racial profiling, it would seem we have entered an era in which artificial intelligence developments are rapidly accelerating. But a fully sentient being whose electronic “brain” can fully engage in complex cognitive tasks using fair moral judgement remains, for now, beyond our capabilities. Unfortunately, current developments …

Clickbaits are made to lure you in. They are so attractive and yet usually misleading. This kind of headlines has been extensively exploited to the point that bots are being used to generate headlines. From the standpoint of readers who want to get information or moderators who want to uphold the integrity of their site’s …

With the huge torrent of content coming to life in social media each second, moderation is quite the challenge. Right now, there are means to automate this process using artificial intelligence (AI) with humans manually screening the content. Makes you wonder if there will ever be a time where humans will be completely out of …

Outliers in data are the weird ones in a set. Their values are way off the rest of the values of the sample. They can really ruin your analysis, especially if you are using methods which are sensitive to the presence of outliers. Given this, a lot are inclined to remove these observations. While this …

Microsoft Build 2019 | Azure Notebooks for Data Science Developers Session ID: CFS2004   Have developer skills but want to scale up to data science in Azure? Azure Notebooks makes it easy to move from developer to including data science skills in your skill set. Microsoft provides an Azure hosted Jupyter Notebook solution with Azure …

MIT system “learns” how to optimally allocate workloads across thousands of servers to cut costs, save energy. A novel system developed by MIT researchers automatically “learns” how to schedule data-processing operations across thousands of servers — a task traditionally reserved for imprecise, human-designed algorithms. Doing so could help today’s power-hungry data centers run far more …