Archive for data

You are browsing the archives of data.

All the Information in the World

1 exabyte = 1 billion gigabytes Kilobyte = 1,000 bytes or 10^3 (half a typewritten page) Megabyte = 1,000,000 bytes or 10^6 (a small novel) Gigabyte = 1,000,000,000 bytes or 10^9 (a pickup truck full of books) Terabyte = 1,000,000,000,000 bytes or 10^12 (1/10th of the Library of Congress) Petabyte = 1,000,000,000,000,000 bytes or 10^15 […]

An Evil Index for AI

The article below is not about Artificial Intelligence (AI). But it is about the ethics of algorithms that are likely to be used as building blocks in developing AI. Algorithms use decision trees to function and if the decision points embed a bias, the results are likely to be biased. If the subset of data […]

Freebase and Common Crawl

Freebase – [] What is Freebase? Freebase is an open, Creative Commons licensed repository of structured data of almost 20 million entities. An entity is a single person, place, or thing. Freebase connects entities together as a graph. Ways to use Freebase: * Use Freebase’s Ids to uniquely identify entities anywhere on the web * […]

Data Visualization

Data Visualization tools are listed here

Alter data

One way that an attacker might use to disrupt an organization could be to alter data. This could either be done overtly or covertly, depending on the purpose: Overt data alteration – this could be designed either to force a reaction by the defenders or to discredit either the data or the defender organization Covert […]