pyspark.sql.functions.
trunc
Returns date truncated to the unit specified by the format.
New in version 1.5.0.
Column
‘year’, ‘yyyy’, ‘yy’ or ‘month’, ‘mon’, ‘mm’
Examples
>>> df = spark.createDataFrame([('1997-02-28',)], ['d']) >>> df.select(trunc(df.d, 'year').alias('year')).collect() [Row(year=datetime.date(1997, 1, 1))] >>> df.select(trunc(df.d, 'mon').alias('month')).collect() [Row(month=datetime.date(1997, 2, 1))]