EXAMPLE - KTHLARGEST Function

This example explores how you can use aggregation functions to calculate rank of values in a column using the KTHLARGEST and KTHLARGESTUNIQUE functions.

Source:

You have a set of student test scores:

StudentScore
Anna84
Ben71
Caleb76
Danielle87
Evan85
Faith92
Gabe87
Hannah99
Ian73
Jane68

Transform:

You can use the following transforms to extract the 1st through 4th-ranked scores on the test:

derive type:single value:KTHLARGEST(Score, 1) as: '1st'

derive type:single value:KTHLARGEST(Score, 2) as: '2nd'

derive type:single value:KTHLARGEST(Score, 3) as: '3rd'

derive type:single value:KTHLARGEST(Score, 4) as: '4th'

derive type:single value:KTHLARGESTUNIQUE(Score, 3) as: '3rdUnique'

derive type:single value:KTHLARGESTUNIQUE(Score, 4) as: '4thUnique'

Results:

When you reorganize the columns, the dataset might look like the following:

StudentScore1st2nd3rd4th3rdUnique4thUnique
Anna84999287878785
Ben71999287878785
Caleb76999287878785
Danielle87999287878785
Evan85999287878785
Faith92999287878785
Gabe87999287878785
Hannah99999287878785
Ian73999287878785
Jane68999287878785

Notes:

  • The value 87 is both the third and fourth scores.
    • For the KTHLARGEST function, it is the output for the third and fourth ranking.
    • For the KTHLARGESTUNIQUE function, it is the output for the third ranking only.
Was this page helpful? Let us know how we did:

Send feedback about...

Google Cloud Dataprep Documentation
Need help? Visit our support page.