pig基本语法——filter distinct foreach

1、主要参考:

https://book.itxueyuan.com/3b7D/9PxP

 

filter主要用户过滤数据,根据条件查询

基础数据:

==============================================

[root@cdh1 data]# cat demodata

xiaoxiao,12,12.1f

aaa,13,1.1f

kjkj,12,12.1f

ddf,19,12.8f

youyou,89,12.3f

==============================================

 

grunt> A = load '/root/xytest/pig/data/demodata' using PigStorage(',') as (name:chararray,age:int,gpa:float);

grunt> B = filter A by age > 16;

grunt> dump B;

 

输出结果:

(ddf,19,12.8)

(youyou,89,12.3)

 

 

==============================================

主要参考:

https://book.itxueyuan.com/3b7D/gJEW

distinct主要用于排重

 

==============================================

数据如下:

aa

bb

aa

bb

cc

==============================================

 

grunt> A = load '/root/xytest/pig/data/demodata3' using PigStorage();

grunt> B = distinct A;

grunt> dump B;

输出结果如下:

(aa)

(bb)

(cc)

 

==============================================

主要参考:

https://book.itxueyuan.com/3b7D/bDZv

foreach主要用户获取指定的数据集

==============================================

[root@cdh1 data]# cat demodata

xiaoxiao,12,12.1f

aaa,13,1.1f

kjkj,12,12.1f

ddf,19,12.8f

youyou,89,12.3f

==============================================

 

grunt> A = load '/root/xytest/pig/data/demodata' using PigStorage(',') as (name:chararray,age:int,gpa:float);

grunt> B = foreach A generate name,gpa;

grunt> dump B;

 

运行结果:

(xiaoxiao,12.1)

(aaa,1.1)

(kjkj,12.1)

(ddf,12.8)

(youyou,12.3)


本文来自互联网用户投稿,文章观点仅代表作者本人,不代表本站立场,不承担相关法律责任。如若转载,请注明出处。 如若内容造成侵权/违法违规/事实不符,请点击【内容举报】进行投诉反馈!

相关文章

立即
投稿

微信公众账号

微信扫一扫加关注

返回
顶部