From 23ac9f57b9b4c761cb8edc5bfa0c0de77ec89326 Mon Sep 17 00:00:00 2001 From: Silvio Rhatto Date: Sat, 30 Sep 2017 14:06:22 -0300 Subject: Change extension to .md --- research/data.md | 28 ++++++++++++++++++++++++++++ 1 file changed, 28 insertions(+) create mode 100644 research/data.md (limited to 'research/data.md') diff --git a/research/data.md b/research/data.md new file mode 100644 index 0000000..777661f --- /dev/null +++ b/research/data.md @@ -0,0 +1,28 @@ +[[!meta title="Data science, lean databases and formats"]] + +## Basic + +* Ontologies and how to deal with lists. +* Standards: schema.org, microdata, microformats, json, yaml, csv, dot, vcard. +* Intelligence: how to easilly search, index and produce outputs with strutured data? +* Samples: TODO and ChangeLog (see [yankee: Changelogs meet YAML](https://github.com/studio-b12/yankee)). + +## Software + +* [mtail](https://packages.debian.org/stable/mtail). +* [Scrapy | A Fast and Powerful Scraping and Web Crawling Framework](https://scrapy.org/). +* [phantomjs in stretch](https://packages.debian.org/stable/phantomjs). +* [wpull](https://wpull.readthedocs.io/en/master/usage.html). +* [Darktable - virtual lighttable and darkroom for photographers](https://packages.debian.org/stable/darktable). +* OsmAnd and GPX tracks. + +## API, bigdata, etc + +* https://stripe.com/blog/idempotency +* https://botman.io +* https://github.com/metabase/metabase +* [Apache Drill](https://drill.apache.org/), [presto](https://github.com/prestodb/presto), hadoop, etc. +* [Redash](https://redash.io/). +* [TensorFlow](https://www.tensorflow.org/). +* [Wikidata](https://www.wikidata.org). +* [Swagger Specification](http://swagger.io/specification/). -- cgit v1.2.3