aboutsummaryrefslogtreecommitdiff
path: root/research/data.mdwn
blob: c25d8f463f65fbb46f6a1fcab176b25753b1a5be (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
[[!meta title="Data science, lean databases and formats"]]

## Basic

* Ontologies and how to deal with lists.
* Standards: schema.org, microdata, microformats, json, yaml, csv, dot, vcard.
* Intelligence: how to easilly search, index and produce outputs with strutured data?
* Samples: TODO and ChangeLog (see [yankee: Changelogs meet YAML](https://github.com/studio-b12/yankee)).

## Software

* [mtail](https://packages.debian.org/stable/mtail).
* [Scrapy | A Fast and Powerful Scraping and Web Crawling Framework](https://scrapy.org/).
* [phantomjs in stretch](https://packages.debian.org/stable/phantomjs).
* [wpull](https://wpull.readthedocs.io/en/master/usage.html).
* [Darktable - virtual lighttable and darkroom for photographers](https://packages.debian.org/stable/darktable).
* OsmAnd and GPX tracks.

## API, bigdata, etc

* https://stripe.com/blog/idempotency
* https://botman.io
* https://github.com/metabase/metabase
* [Apache Drill](https://drill.apache.org/), [presto](https://github.com/prestodb/presto), hadoop, etc.
* [TensorFlow](https://www.tensorflow.org/).
* [Wikidata](https://www.wikidata.org).
* [Swagger Specification](http://swagger.io/specification/).