Filtering rows

In the previous page we learned how select, derive, and join change the columns of a table.

Now we will explore how to manipulate the rows of a table using filter and take.

`filter` transform

The filter transform picks rows to pass through based on their values:

from invoices
filter billing_city == "Berlin"

The resulting table contains all the rows that came from Berlin.

PRQL converts the single filter transform to use the appropriate SQL WHERE or HAVING command, depending on where it appears in the pipeline.

`take` transform

The take transform picks rows to pass through based on their position within the table. The set of rows picked can be specified in two ways:

a plain number x, which will pick the first x rows, or
an inclusive range of rows start..end.

from invoices
take 4

from invoices
take 4..7

Of course, it is possible to combine all these transforms into a single pipeline:

from invoices

# retain only rows for orders from Berlin
filter billing_city == "Berlin"

# skip first 10 rows and take the next 10
take 11..20

# take only first 3 rows of *that* result
take 3

We did something a bit odd at the end: first we took rows 11..20 and then took the first 3 rows from that result.

Note

Note that a single transform take 11..13 would have produced the same SQL. The example shows how PRQL allows fast data exploration by “stacking” transforms in the pipeline. This reduces the cognitive burden: unlike SQL, each new transform interacts only with the results of the previous query.

Keyboard shortcuts

PRQL language book

Filtering rows

filter transform

take transform

`filter` transform

`take` transform