Filter¶
Overview¶
Sanitizing user input is a critical part of software development. Trusting or neglecting to sanitize user input could lead to unauthorized access to the content of your application, mainly user data, or even the server your application is hosted on.
Sanitizing content can be achieved using the Phalcon\Filter\Filter and Phalcon\Filter\FilterFactory classes.
FilterFactory¶
This component creates a new locator with predefined filters attached to it. Each filter is lazy-loaded for maximum performance. To instantiate the factory and retrieve the Phalcon\Filter\Filter with the preset sanitizers you need to call newInstance()
<?php
use Phalcon\Filter\FilterFactory;
$factory = new FilterFactory();
$locator = $factory->newInstance();
You can now use the locator wherever you need and sanitize content as per the needs of your application.
Filter¶
The Phalcon\Filter\Filter component implements a locator service and can be used as a stand-alone component, without initializing the built-in filters.
<?php
use MyApp\Sanitizers\HelloSanitizer;
use Phalcon\Filter\Filter;
$services = [
'hello' => HelloSanitizer::class,
];
$locator = new Filter($services);
$text = $locator->hello('World');
NOTE
The Phalcon\Di\FactoryDefault container already has a Phalcon\Filter\Filter object loaded with the predefined sanitizers. The component can be accessed using the filter
name.
Built-in¶
NOTE
Where appropriate, the sanitizers will cast the value to the type expected. For example, the absint
sanitizer will remove all non-numeric characters from the input, cast the input to an integer, and return its absolute value.
NOTE
To use the predefined filters, you will need to obtain an instance of Phalcon\Filter\Filter
using the Phalcon\Filter\FilterFactory
as shown above.
The following are the built-in filters provided by this component:
absint
¶
Removes any non-numeric characters, casts the value to an integer, and returns its absolute value. Internally it uses filter_var for the integer part, intval for casting, and absint. alnum
¶
Removes all characters that are not numbers or characters of the alphabet. It uses preg_replace which can also accept arrays of strings as the parameters. alpha
¶
Removes all characters that are not characters of the alphabet. It uses preg_replace which can also accept arrays of strings as the parameters. bool
¶
Casts the value to a boolean. It also returns true
if the value is:
true
on
yes
y
1
It also returns false
if the value is:
false
off
no
n
0
email
¶
Removes all characters except letters, digits and !#$%&*+-/=?^_`{\|}~@.[]
. Internally it uses filter_var with FILTER_FLAG_EMAIL_UNICODE
. float
¶
Removes all characters except digits, dots, plus and minus signs and casts the value as a double
. Internally it uses filter_var and (double)
. int
¶
Remove all characters except digits, plus and minus sign, and casts the value as an integer. Internally it uses filter_var and (int)
. lower
¶
Converts all characters to lowercase. If the mbstring extension is loaded, it will use mb_convert_case to perform the transformation. As a fallback, it uses the strtolower PHP function, with utf8_decode. lowerFirst
¶
Converts the first character of the input to lowercase. Internally it uses lcfirst. regex
¶
Performs a regex replacement on the input using a pattern
and the replace
parameter. Internally it uses preg_replace. remove
¶
Performs a replacement on the input, replacing the replace
parameter with an empty string, effectively removing it. Internally it uses str_replace. replace
¶
Performs a replacement on the input based on the from
and to
passed parameters. Internally it uses str_replace. special
¶
Escapes all HTML characters of the input, as well as '"<>&
, and characters with ASCII values less than 32. Internally it uses filter_var. specialFull
¶
Converts all the special characters of the input to HTML entities (both double and single quotes). Internally it uses filter_var. string
¶
Encode HTML entities. Internally it uses [htmlspecialchars][htmlspecialchars]. stringlegacy
¶
This filter will work only for PHP versions lower than 8.1. It is available for backward compatibility. Internally it uses filter_var. striptags
¶
Removes all HTML and PHP tags from the input. Internally it uses strip_tags. trim
¶
Removes all leading and trailing whitespace from the input. Internally it uses trim. upper
¶
Converts all characters to uppercase. If the mbstring extension is loaded, it will use mb_convert_case to perform the transformation. As a fallback, it uses the strtoupper PHP function, with utf8_decode. upperFirst
¶
Converts the first character of the input to upper case. Internally it uses ucfirst. upperWords
¶
Converts into uppercase the first character of each word from the input. Internally it uses ucwords. url
¶
Sanitizes a URL. Internally it uses filter_var. Constants¶
Constants are available and can be used to define the type of sanitizing required:
<?php
const FILTER_ABSINT = 'absint';
const FILTER_ALNUM = 'alnum';
const FILTER_ALPHA = 'alpha';
const FILTER_BOOL = 'bool';
const FILTER_EMAIL = 'email';
const FILTER_FLOAT = 'float';
const FILTER_INT = 'int';
const FILTER_LOWER = 'lower';
const FILTER_LOWERFIRST = 'lowerfirst';
const FILTER_REGEX = 'regex';
const FILTER_REMOVE = 'remove';
const FILTER_REPLACE = 'replace';
const FILTER_SPECIAL = 'special';
const FILTER_SPECIALFULL = 'specialfull';
const FILTER_STRING = 'string';
const FILTER_STRING_LEGACY = 'stringlegacy';
const FILTER_STRIPTAGS = 'striptags';
const FILTER_TRIM = 'trim';
const FILTER_UPPER = 'upper';
const FILTER_UPPERFIRST = 'upperfirst';
const FILTER_UPPERWORDS = 'upperwords';
const FILTER_URL = 'url';
Methods¶
The Phalcon\Filter\Filter acts as a service locator and implements the __call()
method. As a result, you can use any filter as a method directly on the locator. The names of the methods are the same as the ones defined by the constants. To use the built-in filters, you will need to obtain an instance of Phalcon\Filter\Filter
using the Phalcon\Filter\FilterFactory
.
<?php
use Phalcon\Filter\FilterFactory;
$factory = new FilterFactory();
$filter = $factory->newInstance();
$source = -123;
echo $filter->absint($source); // 123
<?php
$filter->absint(mixed $input): int
$filter->alnum(mixed $input): string
$filter->alpha(mixed $input): string
$filter->bool(mixed $input): bool
$filter->email(string $input): string
$filter->float(mixed $input): float
$filter->int(string $input): int
$filter->lower(string $input): string
$filter->lowerfirst(string $input): string
$filter->regex(mixed $input, mixed $pattern, mixed $replace): mixed
$filter->remove(mixed $input, mixed $replace): mixed
$filter->replace(mixed $input, mixed $source, mixed $target): mixed
$filter->special(string $input): string
$filter->specialfull(string $input): string
$filter->string(string $input): string
$filter->stringlegacy(mixed $input): string
$filter->striptags(string $input): string
$filter->trim(string $input): string
$filter->upper(string $input): string
$filter->upperfirst(string $input): string
$filter->upperwords(string $input): string|null
$filter->url(string $input): string|null
```
## Sanitizing Data
Sanitizing is the process that removes specific characters from a value, that are not required or desired by the user or application. By sanitizing input, we ensure that application integrity will be intact.
```php
<?php
use Phalcon\Filter\FilterFactory;
$factory = new FilterFactory();
$locator = $factory->newInstance();
// 'someone@example.com'
$locator->sanitize('some(one)@exa\mple.com', 'email');
// 'hello'
$locator->sanitize('hello<<', 'string');
// '100019'
$locator->sanitize('!100a019', 'int');
// '100019.01'
$locator->sanitize('!100a019.01a', 'float');
Controllers¶
You can access the Phalcon\Filter\Filter object from your controllers when accessing GET
or POST
input data (through the request object). The first parameter is the name of the variable to be obtained; the second is the sanitizer to be applied on it. The second parameter can also be an array with any number of sanitizers that you want to apply.
<?php
use Phalcon\Filter\Filter;
use Phalcon\Http\Request;
use Phalcon\Mvc\Controller;
/**
* Class ProductsController
*
* @property Request $request
*/
class ProductsController extends Controller
{
public function saveAction()
{
if (true === $this->request->isPost()) {
$price = $this->request->getPost('price', 'double');
$email = $this->request->getPost(
'customerEmail',
Filter::FILTER_EMAIL
);
}
}
}
Action Parameters¶
If you have used the Phalcon\Di\FactoryDefault as your DI container, the Phalcon\Filter\Filter is already registered for you with the default sanitizers. To access it we can use the name filter
. If you do not use the Phalcon\Di\FactoryDefault container, you will need to set the service up in it, so that it can be accessible in your controllers.
We can sanitize values passed into controller actions as follows:
<?php
use Phalcon\Filter\Filter;
use Phalcon\Mvc\Controller;
/**
* Class ProductsController
*
* @property Filter $filter
*/
class ProductsController extends Controller
{
public function showAction($productId)
{
// $productId = $this->filter->sanitize($productId, Filter::FILTER_ABSINT);
$productId = $this->filter->sanitize($productId, 'absint');
}
}
Filtering Data¶
The Phalcon\Filter\Filter both filters and sanitizes data, depending on the sanitizers used. For instance, the trim
sanitizer will remove all leading and trailing whitespace, leaving the remaining input unchanged. The description of each sanitizer (see Built-in Sanitizers) can help you to understand and use the sanitizers according to your needs.
<?php
use Phalcon\Filter\FilterFactory;
$factory = new FilterFactory();
$locator = $factory->newInstance();
// 'Hello'
$locator->sanitize('<h1>Hello</h1>', 'striptags');
// 'Hello'
$locator->sanitize(' Hello ', 'trim');
Adding Sanitizers¶
You can add your own sanitizers to Phalcon\Filter\Filter. The sanitizer can be an anonymous function when initializing the locator:
<?php
use Phalcon\Filter\Filter;
$services = [
'md5' => function ($input) {
return md5($input);
},
];
$locator = new Filter($services);
$sanitized = $locator->sanitize($value, 'md5');
If you already have an instantiated filter locator object (for instance if you have used the Phalcon\Filter\FilterFactory and newInstance()
), then you can simply add the custom filter:
<?php
use Phalcon\Filter\FilterFactory;
$factory = new FilterFactory();
$locator = $factory->newInstance();
$locator->set(
'md5',
function ($input) {
return md5($input);
}
);
$sanitized = $locator->sanitize($value, 'md5');
Or, if you prefer, you can implement the filter in a class:
<?php
use Phalcon\Filter\FilterFactory;
class IPv4
{
public function __invoke($value)
{
return filter_var($value, FILTER_VALIDATE_IP, FILTER_FLAG_IPV4);
}
}
$factory = new FilterFactory();
$locator = $factory->newInstance();
$locator->set(
'ipv4',
new Ipv4()
);
// Sanitize with the 'ipv4' filter
$filteredIp = $locator->sanitize('127.0.0.1', 'ipv4');
Combining Sanitizers¶
There are times when one sanitizer is not enough for your data. For instance, a very common usage is the striptags
and trim
sanitizers for text input. The Phalcon\Filter\Filter component offers the ability to accept an array of names for sanitizers to be applied to the input value. The following example demonstrates this:
<?php
use Phalcon\Filter\FilterFactory;
$factory = new FilterFactory();
$locator = $factory->newInstance();
// Returns 'Hello'
$locator->sanitize(
' <h1> Hello </h1> ',
[
'striptags',
'trim',
]
);
Note that this feature also works on the Phalcon\Http\Request object, when calling methods to retrieve data from GET
and POST
, namely getQuery()
and getPost()
.
<?php
use Phalcon\Filter\Filter;
use Phalcon\Http\Request;
use Phalcon\Mvc\Controller;
/**
* Class ProductsController
*
* @property Request $request
*/
class ProductsController extends Controller
{
public function saveAction()
{
if (true === $this->request->isPost()) {
$message = $this->request->getPost(
' <h1> Hello </h1> ',
[
'striptags',
'trim',
]
);
}
}
}
Custom Sanitizer¶
A custom sanitizer can be implemented as an anonymous function. If however, you prefer to use a class per sanitizer, all you need to do is make it callable by implementing the __invoke method with the relevant parameters.
<?php
use Phalcon\Filter\FilterFactory;
$factory = new FilterFactory();
$locator = $factory->newInstance();
$locator->set(
'md5',
function ($input) {
return md5($input);
}
);
$sanitized = $locator->sanitize($value, 'md5');
Or, if you prefer, you can implement the sanitizer in a class:
<?php
use Phalcon\Filter\FilterFactory;
class IPv4
{
public function __invoke($value)
{
return filter_var($value, FILTER_VALIDATE_IP, FILTER_FLAG_IPV4);
}
}
$factory = new FilterFactory();
$locator = $factory->newInstance();
$locator->set(
'ipv4',
function () {
return new Ipv4();
}
);
// Sanitize with the 'ipv4' filter
$filteredIp = $locator->sanitize('127.0.0.1', 'ipv4');