aboutsummaryrefslogtreecommitdiff
path: root/research/data.mdwn
blob: 777661fbc2130333559d2b7fb9f4e2e7cb1ed61f (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
[[!meta title="Data science, lean databases and formats"]]

## Basic

* Ontologies and how to deal with lists.
* Standards: schema.org, microdata, microformats, json, yaml, csv, dot, vcard.
* Intelligence: how to easilly search, index and produce outputs with strutured data?
* Samples: TODO and ChangeLog (see [yankee: Changelogs meet YAML](https://github.com/studio-b12/yankee)).

## Software

* [mtail](https://packages.debian.org/stable/mtail).
* [Scrapy | A Fast and Powerful Scraping and Web Crawling Framework](https://scrapy.org/).
* [phantomjs in stretch](https://packages.debian.org/stable/phantomjs).
* [wpull](https://wpull.readthedocs.io/en/master/usage.html).
* [Darktable - virtual lighttable and darkroom for photographers](https://packages.debian.org/stable/darktable).
* OsmAnd and GPX tracks.

## API, bigdata, etc

* https://stripe.com/blog/idempotency
* https://botman.io
* https://github.com/metabase/metabase
* [Apache Drill](https://drill.apache.org/), [presto](https://github.com/prestodb/presto), hadoop, etc.
* [Redash](https://redash.io/).
* [TensorFlow](https://www.tensorflow.org/).
* [Wikidata](https://www.wikidata.org).
* [Swagger Specification](http://swagger.io/specification/).