EVL – ETL Tool

Products, services and company names referenced in this document may be either trademarks or registered trademarks of their respective owners.

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, with no Front-Cover Texts, and with no Back-Cover Texts.

Introduction
Release Notes
- Version 1.0
- Version 1.1
- Version 1.2
- Version 1.3
- Version 2.0
- Version 2.1
- Version 2.2
- Version 2.3
- Version 2.4
- Version 2.5
- Version 2.6
- Version 2.7
Installation and Settings
- Linux – RPM
- Linux – DEB
- Other Unix systems
- Settings
  - Compiler
  - Project
- Text Editor
  - Vim
EVL Overview
- EVL Jobs
- EVL Workflows
- Scheduling
Main EVL Command
- Usage
- Examples
- Options
- Environment
- evl project
- evl run
- evl workflow
EVD and Data Types
- EVD Structure
- EVD Options
- Default Values
- Compound Types
- String
- Integral Types
- Decimal
  - Declaration in mapping
  - Manipulation, comparison
- Float and Double
- Date and Time
Components Common
- Common Options
Basic Components
- Assign
- Cat
- Cmd
- Component
- Cut
- Departition
- Echo
- Filter
- Gather
- Generate
- Head
- Lookup
- Merge
- Partition
- Sort
- Sortgroup
- Tail
- Tee
- Trash
- Uniq
- Validate
- Watcher
Mapping Components
- Aggreg
- Join
- Map
Read Components
- Read
- Readevd
- Readjson
- Readkafka
- Readmysql
- Readora
- Readparquet
- Readpg
- Readqvd
- Readtd
- Readxls
- Readxlsx
- Readxml
Run SQL Components
- Runmysql
- Runora
- Runpg
Write Components
- Write
- Writeevd
- Writejson
- Writekafka
- Writeora
- Writeparquet
- Writepg
- Writeqvd
- Writeqvx
- Writetd
- Writexlsx
- Writexml
- Writemysql
Commands
- Cancel
- Cp
- Chmod
- Crontab
- End
- Fr
- Log
- Ls
- Mail
- Manager
- Mkdir
- Mv
- Rm
- Set
- Skip
- Sleep
- Spark
- Status
- Test
- Wait
EVM Mappings
- Output Functions
- String Functions
- Checksum Functions
- IP Addresses Functions
  - IPv4 Functions
  - IPv6 Functions
- Randomization Functions
- Anonymization Functions
Joins and Lookups
- Lookup tables
  - Declaration and load
  - Methods
Utils
- csv2evd
- csv2qvd
- evd2sql
- guess-timestamp-format
- json2evd
- pg2evd
- qvd2csv
- qvd2evd
- evl_increment_run_id
- qvd-header
EVM Functions Index
EVD Data Types Index
Variables Index
General Index

Aggreg

(since EVL 1.0)

Applies aggregation mapping on each group of records based on the <key>.

Aggreg: is to be used in EVS job structure definition file. <f_in> and <f_out> are either input and output file or flow name.
evl aggreg: is intended for standalone usage, i.e. to be invoked from command line and reading records from standard input and writing to standard output.

EVD, EVM and EVS are EVL definition files, for details see evl-evd(5), evl-evm(5) and evl-evs(5).

Synopsis

Aggreg
  <f_in> <f_out> (<evd_in>|-D <inline_evd>) (<evd_out>|-d <inline_evd>)
  <evm> --key=<key>   
  [-c|--check-sort] [-i|--ignore-case] [-x|--text-input] [-y|--text-output]    
  [-o <f_out>] [--output<n>=<f_out>]... [--outputs=<varname>]    
  [--reject <f_out>] [--reject<n>=<f_out>]... [--rejects=<varname>]   

evl aggreg
  (<evd_in> | -D <inline_evd>) (<evd_out>|-d <inline_evd>)
  <evm> --key=<key>
  [-c|--check-sort] [-i|--ignore-case] [-x|--text-input] [-y|--text-output]
  [-o|--output <f_out>] [--output<n>=<f_out>]... [--outputs=<varname>]
  [-r|--reject <f_out>] [--reject<n>=<f_out>]... [--rejects=<varname>]
  [-v|--verbose]

evl aggreg
  ( --help | --usage | --version )

Options

-c, --check-sort: check if the input is really sorted according to specified key
-D, --input-definition=<inline_evd>: either this option or the file <evd_in> must be presented. Example: ‘-D 'id int, user_id string'’
-d, --output-definition=<inline_evd>: either this option or the file <evd_out> must be presented. Example: ‘-d 'user_sum long'’
-i, --ignore-case: be case insensitive for key fields
-k, --key=<key>: group by this key, where <key> is comma separated list of fields with type (either DESC or ASC, default type is ASC). Example: ‘--key='id,user_id DESC'’
-o, --output=<f_out>: when output() function is used in the mapping, out structure is forwarded into <f_out>
--output<n>=<f_out>: when function ‘output(<n>)’ is used in mapping, where <n> is an integer from 4 to 16, out structure is forwarded into <f_out>
--outputs=<varname>: specifies an array ‘${<varname>[@]}’ which contains filenames to be used for output(N) functions in mapping. Example: for ‘--outputs=OUTFILE’, ‘${OUTFILE[120]}’ is the filename used for ‘output(120)’
-r, --reject=<f_out>: when reject() function is used in the mapping, input record is rejected into <f_out>
--reject<n>=<f_out>: when function reject(<n>) is used in mapping, where <n> is an integer from 4 to 16, input record is rejected into <f_out>
--rejects=<varname>: specifies an array ‘${<varname>[@]}’ which contains filenames to be used for reject(N) functions in mapping. Example: for ‘--rejects=REJECTS’, ‘${REJECTS[1000]}’ is the filename used for ‘reject(1000)’
-x, --text-input: suppose the input as text, not binary
-y, --text-output: write the output as text, not binary

Standard options:

--help: print this help and exit
--usage: print short usage information and exit
-v, --verbose: print to stderr info/debug messages of the component
--version: print version and exit

Examples

To print to stdout average of amount values:

evl aggreg -D 'amount int' -d 'avg int' average.evm -k '' -xy <in.txt

File ‘average.evm’ might look like this: