mongodb-arrow-connector

MongoDB connector that reads and writes data to/from Apache Arrow


Keywords
mongodb, apache-arrow, arrow, rust
License
Apache-2.0

Documentation

MongoDB Apache Arrow Connector

A Rust library for reading and writing Apache Arrow batches from and to MongoDB.

Licensed under the Apache 2.0 license.

Motivation

We are curently writing this library due to a need to read MongoDB data into dataframes.

Features

  • Read from a collection to batches
  • Write from batches to a collection
  • Infer collection schema
  • Projection predicate push-down
  • Filter predicate push-down
  • Data types
    • Primitive types that MongoDB supports
    • List types
    • Nested structs (bson::Document)
    • Arbitrary binary data