Window
Returns the total number of records for the specified expression.
Returns the population covariance for non-NULL pairs across all input values.
Returns the sample covariance for non-NULL pairs across all input values.
Returns the cumulative distribution of the current row with regard to other values within the same window partition.
Returns the rank of the current row within its partition and ordering. Rows that are equal will have the same rank.
Returns the first value within an ordered group of a result set.
Uses HyperLogLog to return an approximation of the distinct cardinality of the input.
Returns the row before the current one in a partition based on the ORDER BY
clause without the need for a self-join. If there are no rows, this function returns NULL
.
Returns the row after the current one in the same result set without the need for a self-join. If there are no rows, this function returns NULL
.
Returns the maximum value among the non-NULL input expressions.
Returns the minimum value among the non-NULL input expressions.
Returns an approximate distinct value number, similar to COUNT(DISTINCT col)
. NDV can return results faster than using the combination of COUNT and DISTINCT while using a constant amount of memory, resulting in less memory usage for columns with high cardinality.
Equally splits the rows in each partition into ranked parts specified by the integer value and starting from 1. This function requires the ORDER BY
clause.
Returns the relative rank of the current row in the partition based on the ORDER BY
clause. The displayed percentage ranges from 0.0 to 1.0.
Returns the rank of the current row within its partition and placement order. Rows that are equal have the same rank. However, the count of tied rows is added to the next rank, instead of being incremented by one. The rank value starts at 1 and increases sequentially.
Returns the row number for the current row based on the ORDER BY
clause within each partition. Rows containing identical values receive different row numbers.
Returns the sum of non-NULL input expressions.
Returns the population variance of non-NULL records.
Returns the sample variance of non-NULL records.
Window Function Syntax
A window function performs a calculation across a set of table rows that has some relationship to the current row. This is comparable to how an aggregate function can run a calculation. The difference is that a window function does not group rows into a single output row. With a window function, the rows retain their separate identities.
A window function call uses the OVER()
clause directly following the window function’s name and argument(s). The OVER()
clause may use the following optional arguments:
PARTITION BY
: Defines multiple window partitions.ORDER BY
: Orders rows within each partition.
Syntax
Aggregate Window Functions
The OVER()
clause can be used with regular aggregate functions such as:
AVG
COUNT
MAX
MIN
SUM
Example
The following example uses the sample table provided below to show the OVER()
clause used with the SUM
aggregate function.
Product1
A
30.0
30.0
Product2
A
24.0
54.0
Product3
A
2.0
56.0
Product3
B
45.0
45.0
Product2
B
10.0
55.0
Product1
B
3.0
58.0
General-Purpose Window Functions
The OVER()
clause can be used with the following functions:
CUME_DIST()
Double
Calculates the cumulative distribution of the current row within the window partition.
DENSE_RANK()
BIGINT
Returns the rank of the current row within its partition and ordering. Rows that are equal have the same rank.
LAG()
Same as input
Returns the row before the current one in a partition. If there are no rows, returns null.
LEAD()
Same as input
Returns the row after the current one in a partition. If there are no rows, returns null.
NTILE([integer] ntile)
Integer
NTILE function equally splits the rows in each partition into N ranked parts. Has to be used with an ORDER BY
clause.
PERCENT_RANK()
Double
Returns the percent rank of the current row in the partition based on the order by clause.
RANK()
BIGINT
Returns the rank of the current row within its partition and ordering. Rows that are equal have the same rank. However, the count of tied rows is added to the next rank, instead of being incremented by just one.
ROW_NUMBER()
BIGINT
Returns the row number for the current row based on the order by clause within each partition.
For more information about Window Functions, see SQL Window Functions.
Last updated