There are two methods in PHP that can be used to parse URLs, namely parse_url and parse_str.
Parse the URL and return its component parts
mixed parse_url ( string $url [, int $component = -1 ] )
This function parses a URL and returns an associative array containing the various components that appear in the URL.
This function is not used to verify the validity of the given URL, but to break it down into the parts listed below. Incomplete URLs are also accepted and parse_url() will try to parse them as correctly as possible.
Parameters
url The URL to parse. Invalid characters will be replaced with _.
component Specify one of PHP_URL_SCHEME, PHP_URL_HOST, PHP_URL_PORT, PHP_URL_USER, PHP_URL_PASS, PHP_URL_PATH, PHP_URL_QUERY or PHP_URL_FRAGMENT to get the string of the specified part of the URL. (Except when specified as PHP_URL_PORT, an integer value will be returned).
Return value
For severely unqualified URLs, parse_url() may return FALSE.
If the component parameter is omitted, an associative array array will be returned, and at least one element will currently be in the array. The possible keys in the array are:
If the component argument is specified, parse_url() returns a string (or an integer when specified as PHP_URL_PORT) instead of an array. If the specified component in the URL does not exist, NULL will be returned.
Example
<?<span>php </span><span>$url</span> = 'http://username:password@hostname/path?arg=value#anchor'<span>; </span><span>print_r</span>(<span>parse_url</span>(<span>$url</span><span>)); </span><span>echo</span> <span>parse_url</span>(<span>$url</span>,<span> PHP_URL_PATH); </span>?>
The above routine will output:
<span>Array</span><span> ( [scheme] </span>=><span> http [host] </span>=><span> hostname [user] </span>=><span> username [pass] </span>=><span> password [path] </span>=> /<span>path [query] </span>=> arg=<span>value [fragment] </span>=><span> anchor ) </span>/path
Parse the string into multiple variables
void parse_str ( string $str [, array &$arr ] )
If str is the query string passed in by the URL, parse it into a variable and set it to the current scope.
To get the current QUERY_STRING, you can use the $_SERVER['QUERY_STRING'] variable.
Parameters
str The input string.
arr If the second variable arr is set, the variable will be stored in this array as an array element instead. ,
Example
<?<span>php </span><span>$str</span> = "first=value&arr[]=foo+bar&arr[]=baz"<span>; </span><span>parse_str</span>(<span>$str</span><span>); </span><span>echo</span> <span>$first</span>; <span>//</span><span> value</span> <span>echo</span> <span>$arr</span>[0]; <span>//</span><span> foo bar</span> <span>echo</span> <span>$arr</span>[1]; <span>//</span><span> baz</span> <span>parse_str</span>(<span>$str</span>, <span>$output</span><span>); </span><span>echo</span> <span>$output</span>['first']; <span>//</span><span> value</span> <span>echo</span> <span>$output</span>['arr'][0]; <span>//</span><span> foo bar</span> <span>echo</span> <span>$output</span>['arr'][1]; <span>//</span><span> baz</span> ?>
I was reading the source code of php-resque some time ago and saw the application of these two methods. I feel that they are used very well to parse the settings of the redis link.
The format of the redis link is: redis://user:pass@host:port/db?option1=val1&option2=val2. Is it the same as the URL, so it is easy to parse using the above two methods.
Address: https://github.com/chrisboulton/php-resque/blob/master/lib/Resque/Redis.php
The code is as follows:
<span>/*</span><span>* * Parse a DSN string, which can have one of the following formats: * * - host:port * - redis://user:pass@host:port/db?option1=val1&option2=val2 * - tcp://user:pass@host:port/db?option1=val1&option2=val2 * * Note: the 'user' part of the DSN is not used. * * @param string $dsn A DSN string * @return array An array of DSN compotnents, with 'false' values for any unknown components. e.g. * [host, port, db, user, pass, options] </span><span>*/</span> <span>public</span> <span>static</span> <span>function</span> parseDsn(<span>$dsn</span><span>) { </span><span>if</span> (<span>$dsn</span> == ''<span>) { </span><span>//</span><span> Use a sensible default for an empty DNS string</span> <span>$dsn</span> = 'redis://' . self::<span>DEFAULT_HOST; } </span><span>$parts</span> = <span>parse_url</span>(<span>$dsn</span><span>); </span><span>//</span><span> Check the URI scheme</span> <span>$validSchemes</span> = <span>array</span>('redis', 'tcp'<span>); </span><span>if</span> (<span>isset</span>(<span>$parts</span>['scheme']) && ! <span>in_array</span>(<span>$parts</span>['scheme'], <span>$validSchemes</span><span>)) { </span><span>throw</span> <span>new</span> \InvalidArgumentException("Invalid DSN. Supported schemes are " . <span>implode</span>(', ', <span>$validSchemes</span><span>)); } </span><span>//</span><span> Allow simple 'hostname' format, which `parse_url` treats as a path, not host.</span> <span>if</span> ( ! <span>isset</span>(<span>$parts</span>['host']) && <span>isset</span>(<span>$parts</span>['path'<span>])) { </span><span>$parts</span>['host'] = <span>$parts</span>['path'<span>]; </span><span>unset</span>(<span>$parts</span>['path'<span>]); } </span><span>//</span><span> Extract the port number as an integer</span> <span>$port</span> = <span>isset</span>(<span>$parts</span>['port']) ? <span>intval</span>(<span>$parts</span>['port']) : self::<span>DEFAULT_PORT; </span><span>//</span><span> Get the database from the 'path' part of the URI</span> <span>$database</span> = <span>false</span><span>; </span><span>if</span> (<span>isset</span>(<span>$parts</span>['path'<span>])) { </span><span>//</span><span> Strip non-digit chars from path</span> <span>$database</span> = <span>intval</span>(<span>preg_replace</span>('/[^0-9]/', '', <span>$parts</span>['path'<span>])); } </span><span>//</span><span> Extract any 'user' and 'pass' values</span> <span>$user</span> = <span>isset</span>(<span>$parts</span>['user']) ? <span>$parts</span>['user'] : <span>false</span><span>; </span><span>$pass</span> = <span>isset</span>(<span>$parts</span>['pass']) ? <span>$parts</span>['pass'] : <span>false</span><span>; </span><span>//</span><span> Convert the query string into an associative array</span> <span>$options</span> = <span>array</span><span>(); </span><span>if</span> (<span>isset</span>(<span>$parts</span>['query'<span>])) { </span><span>//</span><span> Parse the query string into an array</span> <span>parse_str</span>(<span>$parts</span>['query'], <span>$options</span><span>); } </span><span>return</span> <span>array</span><span>( </span><span>$parts</span>['host'], <span>$port</span>, <span>$database</span>, <span>$user</span>, <span>$pass</span>, <span>$options</span>,<span> ); }</span>