| filter_containing | | Filter rows containing any or all of a number of specified values |
| filter_duplicate_nodes | | Remove duplicate nodes in a network |
| filter_duplicates | | Filter duplicate rows, keeping the first or last of each set of duplicates found only |
| filter_missing | | Filter rows based on missing values in one or more columns |
| filter_range | | Filter rows based on the numeric values in a given column |
| filter_row_numbers | | Filter rows by row number |
| filter_rows | ⚡ | Filter rows using graphext’s advanced query syntax (similar to Elasticsearch) |
| filter_sample | | Randomly sample the dataset, optionally within groups (can be used to balance a dataset) |
| filter_topn | | Sort a dataset by selected columns and pick the first N rows (or exclude them) |
| filter_values | | Filter rows where column matches specified values exactly |
| filter_with_formula | | Filter rows using a (pandas-compatible) formula |
| upsample | | Upsample a dataset given a weight column |