- Plugin type: output
- Load all or nothing: no
- Resume supported: no
- Cleanup supported: yes
-
path_prefix: A prefix of output path. (string, required)
- support:
file
,s3
,s3n
ands3a
.
- support:
-
file_ext: An extension of output file. (string, default:
.orc
) -
sequence_format: (string, default:
.%03d
) -
buffer_size: Set the ORC buffer size (integer, default:
262144(256KB)
) -
strip_size: Set the ORC strip size (integer, default:
67108864(64MB)
) -
block_size: Set the ORC block size (integer, default:
268435456(256MB)
) -
compression_kind: description (string, default:
'ZLIB'
)-
NONE
,ZLIB
,SNAPPY
,LZO
,LZ4
-
-
overwrite: Overwrite if output files already exist. (boolean, default:
false
)- Support:
LocalFileSystem
,S3(s3, s3a, s3n)
- Support:
-
default_from_timezone Time zone of timestamp columns. This can be overwritten for each column using column_options (DateTimeZone, default:
UTC
) -
auth_method: name of mechanism to authenticate requests (basic, env, instance, profile, properties, anonymous, or session. default: basic)
see: https://github.com/embulk/embulk-input-s3#configuration-
env
,basic
,profile
,default
,session
,anonymous
,properties
-
out:
type: orc
path_prefix: "/tmp/output"
compression_kind: ZLIB
overwrite: true
- Bump
orc
library to1.5.4
- bugfix
- bugfix
- Bump
orc
library to1.4.4
- Update
orc
libraries to1.4.3
-
Change default value : (block_size, buffer_size, strip_size)
- default value is Hive's default value.
(see: https://orc.apache.org/docs/hive-config.html)
- default value is Hive's default value.
-
support: output to s3
-
s3n
,s3a
protocol
-
- initial release
$ ./gradlew gem # -t to watch change of files and rebuild continuously