EXAMPLE - IPTOINT Function

This examples illustrates how you can convert IP addresses to numeric values for purposes of comparison and sorting. This example illustrates the following functions:

  • IPTOINT - converts an IP address to an integer value according to a formula. See IPTOINT Function.
  • IPFROMINT - converts an integer value back to an IP address according to formula. See IPFROMINT Function.

Source:

Your dataset includes the following values for IP addresses:

IpAddr
192.0.0.1
10.10.10.10
1.2.3.4
1.2.3
http://12.13.14.15
https://16.17.18.19

Transformation:

When the above data is imported, the application initially types the column as URL values, due to the presence of the http:// and https:// protocol identifiers. Select the IP Address data type for the column. The last three values are listed as mismatched values. You can fix the issues with the last two entries by applying the following transform, which matches on both http:// and https:// strings:

Transformation Name Replace text or pattern
Parameter: Column IpAddr
Parameter: Find `http%?://`
Parameter: Replace with ''

NOTE: The %? Cloud Dataprep pattern matches zero or one time on any character, which enables the matching on both variants of the protocol identifier.

Now, only the 1.2.3 value is mismatched. Perhaps you know that there is a missing zero at the end of it. To add it back, you can do the following:

Transformation Name Replace text or pattern
Parameter: Column IpAddr
Parameter: Find `1.2.3[end]`
Parameter: Replace with '1.2.3.0'
Parameter: Match all occurrences true

All values in the column should be valid for the IP Address data type. To convert these values to their integer equivalents:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula IPTOINT(IpAddr)
Parameter: New column name 'ip_as_int'

You can now manipulate the data based on this numeric key. To convert the integer values back to IP addresses for checking purposes, use the following:

Transformation Name New formula
Parameter: Formula type Single row formula
Parameter: Formula IPFROMINT(ip_as_int)
Parameter: New column name 'ip_check'

Results:

Xip_as_intip_check
192.0.0.13221225473192.0.0.1
10.10.10.1016843009010.10.10.10
1.2.3.4169090601.2.3.4
1.2.3.0169090561.2.3.0
12.13.14.1520218215912.13.14.15
16.17.18.1926955419516.17.18.19