same as Arrow Flight SQL
A new general-purpose client-server framework to simplify high performance transport of large datasets over network interfaces.
- initially is focused on optimized transport of the Arrow columnar format (i.e. “Arrow record batches”) over gRPC, Google’s popular HTTP/2-based general-purpose RPC library and framework.While we have focused on integration with gRPC, as a development framework Flight is not intended to be exclusive to gRPC.
One of the biggest features that sets apart Flight from other data transport frameworks is:
- parallel transfers: allowing data to be streamed to or from a cluster of servers simultaneously.
This enables developers to more easily create scalable data services that can serve a growing client base.
# Motivation
Our design goal for Flight is to create a new protocol for data services that uses the Arrow columnar format as both the over-the-wire data representation as well as the public API presented to developers.
In doing so, we reduce or remove the serialization costs associated with data transport and increase the overall efficiency of distributed data systems. Additionally, two systems that are already using Apache Arrow for other purposes can communicate data to each other with extreme efficiency. Source
It’s an alternative to ODBC and JDBC. Implementations of standard protocols like ODBC generally implement their own custom on-wire binary protocols that must be marshalled to and from each library’s public interface. The performance of ODBC or JDBC libraries varies greatly from case to case.
# Questions
Questions:
- ? What’s the difference to IPFS?
Origin: Overview — Arrow DataFusion documentation
References:
Created 2022-10-26
