Newer Version Available
foreach
Syntax
1q = foreach q generate expression as alias[, expression as alias ...];The output column names are specified with the as keyword. The output data is ungrouped.
Using foreach with Ungrouped Data
When used with ungrouped data, the foreach statement maps the input rows to output rows. The number of rows remains the same.
Example
Using foreach with Grouped Data
When used with grouped data, the foreach statement behaves differently than it does with ungrouped data.
Fields can be directly accessed only when the value is the same for all group members. For example, the fields that were used as the grouping keys have the same value for all group members. Otherwise, use aggregate functions to access the members of a group. The type of the column determines which aggregate functions you can use. For example, if the column type is numeric, you can use the sum() function.
Example
Using foreach with a case Expression
Example
1q = load "data";
2q = foreach q generate xInt, (case xInt % 3
3 when 0 then "3n"
4 when 1 then "3n+1"
5 else "3n+2"
6end) as modThree;Example
1q = load "data";
2q = foreach q generate price, (case
3 when price < 1000 then "category1"
4 when price >= 1000 and price < 2000 then "category2"
5 else "category3"
6end) as priceLevel;Use Unique Names
Using a name multiple times in a projection throws an error.
1l = load "0Fabb000000002qCAA/0Fabb000000002WCAQ";
2r = load "0Fcyy000000002qCAA/0Fcyy000000002WCAQ";
3l = foreach l generate 'value'/'divisor' as 'value' , category as category;
4r = foreach r generate 'value'/'divisor' as 'value' , category as category;
5cg = cogroup l by category right, r by category;
6cg = foreach cg generate r.category as 'category', sum(r.value) as sumrval, sum(l.value) as sumrval;