Apache Pig - GetYear()
This function accepts a date-time object as parameter and returns the current year from the given date-time object.
Syntax
Here is the syntax of the GetYear() function.
grunt> GetYear(datetime)
Example
Assume that there is a file named date.txt in the HDFS directory /pig_data/ as shown below. This file contains the date-of-birth details of a particular person, it has person id, date and time.
date.txt
001,1989/09/26 09:00:00 002,1980/06/20 10:22:00 003,1990/12/19 03:11:44
And, we have loaded this file into Pig with a relation named date_data as shown below.
grunt> date_data = LOAD 'hdfs://localhost:9000/pig_data/date.txt' USING PigStorage(',') as (id:int,date:chararray);
Following is an example of the GetYear() function. It will retrive the current year from the given date-time object. Therefore, First of all let’s generate the date-time objects of all employees using todate() function as shown below.
grunt> todate_data = foreach date_data generate ToDate(date,'yyyy/MM/dd HH:mm:ss') as (date_time:DateTime ); grunt> Dump todate_data; (1989-09-26T09:00:00.000+05:30) (1980-06-20T10:22:00.000+05:30) (1990-12-19T03:11:44.000+05:30)
Let us now get the year from the date-of-birth of each employee using the GetYear() function and store it in the relation named getyear_data.
grunt> getyear_data = foreach todate_data generate (date_time), GetYear(date_time);
Now verify the contents of the getyear_data relation using Dump operator as shown below.
grunt> Dump getyear_data; (1989-09-26T09:00:00.000+05:30,1989) (1980-06-20T10:22:00.000+05:30,1980) (1990-12-19T03:11:44.000+05:30,1990)