EVL – ETL Tool

Products, services and company names referenced in this document may be either trademarks or registered trademarks of their respective owners.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, with no Front-Cover Texts, and with no Back-Cover Texts.

Introduction
Release Notes
- Version 1.0
- Version 1.1
- Version 1.2
- Version 1.3
- Version 2.0
- Version 2.1
- Version 2.2
- Version 2.3
- Version 2.4
- Version 2.5
- Version 2.6
- Version 2.7
Installation and Settings
- Linux – RPM
- Linux – DEB
- Other Unix systems
- Settings
  - Compiler
  - Project
- Text Editor
  - Vim
EVL Overview
- EVL Jobs
- EVL Workflows
- Scheduling
Main EVL Command
- Usage
- Examples
- Options
- Environment
- evl project
- evl run
- evl workflow
EVD and Data Types
- EVD Structure
- EVD Options
- Default Values
- Compound Types
- String
- Integral Types
- Decimal
  - Declaration in mapping
  - Manipulation, comparison
- Float and Double
- Date and Time
Components Common
- Common Options
Basic Components
- Assign
- Cat
- Cmd
- Component
- Cut
- Departition
- Echo
- Filter
- Gather
- Generate
- Head
- Lookup
- Merge
- Partition
- Sort
- Sortgroup
- Tail
- Tee
- Trash
- Uniq
- Validate
- Watcher
Mapping Components
- Aggreg
- Join
- Map
Read Components
- Read
- Readevd
- Readjson
- Readkafka
- Readmysql
- Readora
- Readparquet
- Readpg
- Readqvd
- Readtd
- Readxls
- Readxlsx
- Readxml
Run SQL Components
- Runmysql
- Runora
- Runpg
Write Components
- Write
- Writeevd
- Writejson
- Writekafka
- Writeora
- Writeparquet
- Writepg
- Writeqvd
- Writeqvx
- Writetd
- Writexlsx
- Writexml
- Writemysql
Commands
- Cancel
- Cp
- Chmod
- Crontab
- End
- Fr
- Log
- Ls
- Mail
- Manager
- Mkdir
- Mv
- Rm
- Set
- Skip
- Sleep
- Spark
- Status
- Test
- Wait
EVM Mappings
- Output Functions
- String Functions
- Checksum Functions
- IP Addresses Functions
  - IPv4 Functions
  - IPv6 Functions
- Randomization Functions
- Anonymization Functions
Joins and Lookups
- Lookup tables
  - Declaration and load
  - Methods
Utils
- csv2evd
- csv2qvd
- evd2sql
- guess-timestamp-format
- json2evd
- pg2evd
- qvd2csv
- qvd2evd
- evl_increment_run_id
- qvd-header
EVM Functions Index
EVD Data Types Index
Variables Index
General Index

Departition

(since EVL 1.2)

Gather or merge partitions into one output flow or file. When ‘-k <key>’ is specified, then sorted input of each partition is supposed and output will be again sorted (i.e. merged). With no ‘-k <key>’, it gather input partitions in round-robin fashion. Applying to only one partition simply write input to output. EVD is EVL data definition file, for details see evl-evd(5).

Synopsis

Departition
  <f_in>... <f_out> (<evd>|-d <inline_evd>)
  (--key=<key> | --round-robin)
  [--validate] [-x|--text-input] [-y|--text-output]

evl departition
  <file_in> <file_out> (<evd>|-d <inline_evd>)
  (--key=<key> | --round-robin)
  [-v|--validate] [-x|--text-input] [-y|--text-output]
  [-v|--verbose]

evl departition
  ( --help | --usage | --version )

Options

-d, --data-definition=<inline_evd>: either this option or the file <evd> must be presented. Example: -d ’id int, user_id string(6) enc=iso-8859-1’
-k, --key=<key>: merge partitioned flows/files according to the key, so the output is sorted by this key
-r, --round-robin: gather in round-robin fashion
--validate: without this option, no fields are checked against data types. With this option, all output fields are checked
-x, --text-input: suppose the input as text, not binary
-y, --text-output: write the output as text, not binary

Standard options:

--help: print this help and exit
--usage: print short usage information and exit
-v, --verbose: print to stderr info/debug messages of the component
--version: print version and exit

Examples

To departition partitioned flow in the EVL job: