Skip to content

Commit c7f0475

Browse files
authored
Reorder and add Kubernetes (#106)
1 parent 8fa38e5 commit c7f0475

File tree

1 file changed

+27
-28
lines changed

1 file changed

+27
-28
lines changed

on_the_web/index.md

Lines changed: 27 additions & 28 deletions
Original file line numberDiff line numberDiff line change
@@ -3,57 +3,56 @@ layout: default
33
title: On The Web
44
---
55

6-
<p><a href="https://github.com/pytest-dev/pytest-reportlog">pytest-reportlog</a> is a pytest plugin which writes testing report data in JSON lines format</p>
7-
8-
<p><a href="https://github.com/ndjson/ndjson-spec">NDJSON</a> is a similar format that also allows blank lines</p>
6+
<p><a href="https://airbyte.io">Airbyte</a> is an open-source data integration tool that uses JSON Lines to communicate between containerized source applications that pull data from files/APIs/databses and containerized destination applications that write data to warehouses.</p>
97

10-
<p><a href="https://okfnlabs.org/blog/2014/09/01/bubbles-python-etl.html">Bubbles</a> supports JSON Lines datastores</p>
8+
<p><a href="https://spark.apache.org">Apache Spark</a> uses JSONL for reading and writing JSON data.</p>
119

12-
<p><a href="https://logstash.net">Logstash</a> supports JSON Lines via the <a href="https://logstash.net/docs/latest/codecs/json_lines">json_lines codec</a></p>
10+
<p><a href="https://arangodb.com">ArangoDB</a> is an open source multi-model database. The JSON lines format allows to import huge amounts of documents sequentially (via <a href="https://www.arangodb.com/docs/stable/programs-arangoimport-examples-json.html">arangoimport</a>).</p>
1311

14-
<p><a href="https://plot.ly/streaming/">plot.ly</a> uses JSON Lines for its streaming data API</p>
12+
<p><a href="https://cloud.google.com/bigquery">BigQuery</a> uses JSON Lines as one of the supported formats to load data into the database.</p>
1513

16-
<p><a href="https://www.graylog.org/resources/gelf/">Graylog GELF</a> is format for log messages, their stream is de-facto JSON lines.</p>
14+
<p><a href="https://bkl.gopatchy.io/">BKL</a> is a layered templating configuration tool that supports JSON Lines input and output.</p>
1715

18-
<p><a href="https://www.scrapy.org">Scrapy</a> is a framework for web scraping &amp; crawling, it supports and recommends JSON lines since long -- <a href="https://twitter.com/pablohoffman/status/547385799093022723">it might've even coined the term</a>.</p>
16+
<p><a href="https://okfnlabs.org/blog/2014/09/01/bubbles-python-etl.html">Bubbles</a> supports JSON Lines datastores</p>
1917

2018
<p><a href="https://clickhouse.yandex/">ClickHouse</a> is an open source column-oriented DBMS. It supports JSON lines as <a href="https://clickhouse.yandex/reference_en.html#JSONEachRow">JSONEachRow</a> format for input and output.</p>
2119

22-
<p><a href="https://dataflowkit.com">Dataflow kit</a> is a web scraping open source framework written in Go. JSON Lines is one of the supported formats for storing results.</p>
20+
<p><a href="https://www.htmlvalidator.com/">CSS HTML Validator for Windows v22.0211+</a> now supports JSON Lines syntax checking.</p>
2321

24-
<p><a href="https://github.com/dart-lang/test/blob/master/pkgs/test/doc/json_reporter.md">dart</a> uses JSON Lines as one of the possible reporters when running tests.</p>
22+
<p><a href="https://github.com/dart-lang/test/blob/master/pkgs/test/doc/json_reporter.md">Dart</a> uses JSON Lines as one of the possible reporters when running tests.</p>
2523

26-
<p><a href="https://spark.apache.org">Apache Spark</a> uses JSONL for reading and writing JSON data.</p>
24+
<p><a href="https://dataflowkit.com">Dataflow kit</a> is a web scraping open source framework written in Go. JSON Lines is one of the supported formats for storing results.</p>
2725

28-
<p><a href="https://arangodb.com">ArangoDB</a> is an open source multi-model database. The JSON lines format allows to import huge amounts of documents sequentially (via <a href="https://www.arangodb.com/docs/stable/programs-arangoimport-examples-json.html">arangoimport</a>).</p>
26+
<p><a href="https://pkg.go.dev/encoding/json#NewEncoder">Go Standard library's json.Encoder</a> will produce JSON lines by default. The decoder parses Concatenated JSON, which is compatible with, though less strict than, JSON lines</p>
2927

30-
<p><a href="https://www.rumbledb.org/">Rumble</a> is a JSONiq engine that runs on top of Spark. It can process datasets in the JSON lines format that have billions of objects and more.</p>
28+
<p><a href="https://github.com/simonfrey/jsonl">Golang JSONL library</a></p>
3129

32-
<p><a href="https://neo4j.com/developer">Neo4j</a> the open-source graph database supports JSONL export and import via its standard library procedures <a href="https://neo4j.com/labs/apoc/4.1/export/json/#export-database-json"><code>apoc.export/import.json</code></a> to allow stream processing of nodes and relationships.</p>
30+
<p><a href="https://www.graylog.org/resources/gelf/">Graylog GELF</a> is format for log messages, their stream is de-facto JSON lines.</p>
3331

34-
<p><a href="https://github.com/petl-developers/petl">petl</a> is a general purpose Python package for extracting, transforming and loading tables of data. It allows importing and exporting documents/records between many <a href="https://petl.readthedocs.io/en/stable/io.html#databases">databases</a> and file formats, including <a href="https://petl.readthedocs.io/en/stable/io.html#json-files">JSON lines</a>, in local and <a href="https://petl.readthedocs.io/en/stable/io.html#remote-i-o-helper-classes">remote</a> filesystems and clouds.</p>
32+
<p><a href="https://kubernetes.io">Kubernetes (k8s)</a> is an open-source container orchestration system. It uses JSON Lines as format for <a href="https://kubernetes.io/docs/tasks/debug/debug-cluster/audit/">auditing</a>.</p>
3533

36-
<p><a href="https://cloud.google.com/bigquery">BigQuery</a> uses JSON Lines as one of the supported formats to load data into the database.</p>
34+
<p><a href="https://logstash.net">Logstash</a> supports JSON Lines via the <a href="https://logstash.net/docs/latest/codecs/json_lines">json_lines codec</a></p>
3735

38-
<p><a href="https://airbyte.io">Airbyte</a> is an open-source data integration tool that uses JSON Lines to communicate between containerized source applications that pull data from files/APIs/databses and containerized destination applications that write data to warehouses.</p>
36+
<p><a href="https://mattermost.com/">Mattermost</a> is an open-source, self-hostable online chat service. It uses JSON Lines as the format for <a href="https://docs.mattermost.com/onboard/bulk-loading-data.html">bulk data migration</a> on self-hosted instances.</p>
3937

40-
<p><a href="https://shopify.dev/api/usage/bulk-operations/queries">Shopify GraphQL Bulk Operations API</a>, designed for very large data exports from Shopify stores, returns results in the form of a JSONL file.</p>
38+
<p><a href="https://miller.readthedocs.io/en/latest/file-formats/#json-lines">Miller</a> supports JSON Lines format as input.</p>
4139

42-
<p><a href="https://www.htmlvalidator.com/">CSS HTML Validator for Windows v22.0211+</a> now supports JSON Lines syntax checking.</p>
43-
44-
<p><a href="https://pkg.go.dev/encoding/json#NewEncoder">Go Standard library's json.Encoder</a> will produce JSON lines by default. The decoder parses Concatenated JSON, which is compatible with, though less strict than, JSON lines</p>
40+
<p><a href="https://github.com/ndjson/ndjson-spec">NDJSON</a> is a similar format that also allows blank lines</p>
4541

46-
<p><a href="https://github.com/simonfrey/jsonl">Golang JSONL library</a></p>
42+
<p><a href="https://neo4j.com/developer">Neo4j</a> the open-source graph database supports JSONL export and import via its standard library procedures <a href="https://neo4j.com/labs/apoc/4.1/export/json/#export-database-json"><code>apoc.export/import.json</code></a> to allow stream processing of nodes and relationships.</p>
4743

48-
<p><a href="https://miller.readthedocs.io/en/latest/file-formats/#json-lines">Miller</a> supports JSON Lines format as input.</p>
44+
<p><a href="https://petl.readthedocs.io/en/stable/io.html#databases">petl</a> is a general purpose Python package for extracting, transforming and loading tables of data. It allows importing and exporting documents/records between many databases and file formats, including <a href="https://petl.readthedocs.io/en/stable/io.html#json-files">JSON lines</a>, in local and <a href="https://petl.readthedocs.io/en/stable/io.html#remote-i-o-helper-classes">remote</a> filesystems and clouds.</p>
4945

50-
<p><a href="https://mattermost.com/">Mattermost</a> is an open-source, self-hostable online chat service. It uses JSON Lines as the format for <a href="https://docs.mattermost.com/onboard/bulk-loading-data.html">bulk data migration</a> on self-hosted instances.</p>
46+
<p><a href="https://php-jsonl">php-jsonl</a> is a PHP library for reading &amp; writing JSON Lines documents, taking advantage of the streaming benefits.</p>
5147

52-
<p><a href="https://docs.mattermost.com/onboard/bulk-loading-data.html">Mattermost</a> is a collaboration tool and uses the JSON Lines format for bulk data import.</p>
48+
<p><a href="https://plot.ly/streaming/">plot.ly</a> uses JSON Lines for its streaming data API</p>
5349

54-
<p><a href="https://github.com/jwodder/serde-jsonlines">serde-jsonlines</a> is a Rust library for reading &amp; writing JSON Lines documents.</p>
50+
<p><a href="https://github.com/pytest-dev/pytest-reportlog">pytest-reportlog</a> is a pytest plugin which writes testing report data in JSON lines format</p>
5551

56-
<p><a href="https://github.com/indykoning/php-jsonl">php-jsonl</a> is a PHP library for reading &amp; writing JSON Lines documents, taking advantage of the streaming benefits.</p>
52+
<p><a href="https://www.rumbledb.org/">Rumble</a> is a JSONiq engine that runs on top of Spark. It can process datasets in the JSON lines format that have billions of objects and more.</p>
5753

58-
<p><a href="https://bkl.gopatchy.io/">BKL</a> is a layered templating configuration tool that supports JSON Lines input and output.</p>
54+
<p><a href="https://www.scrapy.org">Scrapy</a> is a framework for web scraping &amp; crawling, it supports and recommends JSON lines since long -- <a href="https://twitter.com/pablohoffman/status/547385799093022723">it might've even coined the term</a>.</p>
5955

56+
<p><a href="https://github.com/jwodder/serde-jsonlines">serde-jsonlines</a> is a Rust library for reading &amp; writing JSON Lines documents.</p>
57+
58+
<p><a href="https://shopify.dev/api/usage/bulk-operations/queries">Shopify GraphQL Bulk Operations API</a>, designed for very large data exports from Shopify stores, returns results in the form of a JSONL file.</p>

0 commit comments

Comments
 (0)