获取 URL 的一部分(正则表达式) [英] Getting parts of a URL (Regex)

查看：44 发布时间：2021/12/2 23:24:46 regex language-agnostic url

本文介绍了获取 URL 的一部分(正则表达式)的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

Given the URL (single line):
http://test.example.com/dir/subdir/file.html

如何使用正则表达式提取以下部分:

How can I extract the following parts using regular expressions:

即使我输入以下 URL，正则表达式也应该可以正常工作:

The regex should work correctly even if I enter the following URL:

http://example.example.com/example/example/example.html

一个正则表达式来解析和分解一个包含查询参数的完整 URL和锚点，例如

A single regex to parse and breakup a full URL including query parameters and anchors e.g.

^((http[s]?|ftp):/)?/?([^:/s]+)((/w+)*/)([w-.]+[^#?s]+)(.*)?(#[w-]+)?$

RexEx 职位:

url: RegExp['$&'],

protocol:RegExp.$2,

host:RegExp.$3,

path:RegExp.$4,

file:RegExp.$6,

query:RegExp.$7,

hash:RegExp.$8

然后你可以很容易地进一步解析主机('.'分隔).

you could then further parse the host ('.' delimited) quite easily.

我会做的是使用这样的东西:

What I would do is use something like this:

/*
    ^(.*:)//([A-Za-z0-9-.]+)(:[0-9]+)?(.*)$
*/
proto $1
host $2
port $3
the-rest $4

进一步解析其余"以尽可能具体.用一个正则表达式来做，嗯，有点疯狂.

the further parse 'the rest' to be as specific as possible. Doing it in one regex is, well, a bit crazy.

这篇关于获取 URL 的一部分(正则表达式)的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文