网站链接正则表达式 stackoverflow (1)

📌 相关文章

📜 网站链接正则表达式 stackoverflow (1)

📅 最后修改于: 2023-12-03 14:57:01.574000 🧑 作者: Mango

网站链接正则表达式

正则表达式是一种用于匹配字符串的强大工具。在编程中，经常使用正则表达式来提取、搜索或替换字符串。在本文中，我们将介绍如何使用正则表达式来匹配网站链接。

匹配 HTTP 和 HTTPS 链接

匹配 HTTP 链接

下面是一个匹配 HTTP 链接的正则表达式：

http:\/\/[a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}(\/\S*)?

该正则表达式将匹配以下 HTTP 链接：

http://www.example.com
http://www.example.com/
http://www.example.com/index.html
http://www.example.com/documents/document.pdf

匹配 HTTPS 链接

下面是一个匹配 HTTPS 链接的正则表达式：

https:\/\/[a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}(\/\S*)?

该正则表达式将匹配以下 HTTPS 链接：

https://www.example.com
https://www.example.com/
https://www.example.com/index.html
https://www.example.com/documents/document.pdf

匹配其他协议的链接

如果你需要匹配其他协议的链接，只需将 HTTP 或 HTTPS 替换为你所需的协议即可。

匹配不包含协议的链接

如果你需要匹配不包含协议的链接，可以使用以下正则表达式：

([a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3})(\/\S*)?

该正则表达式将匹配以下链接：

www.example.com
www.example.com/
www.example.com/index.html
www.example.com/documents/document.pdf

在程序中使用正则表达式

以下是一个使用 Python 正则表达式模块 re 来匹配 HTTP 链接的示例代码：

import re

string = "Visit my website at http://www.example.com"
regex = r'http:\/\/[a-zA-Z0-9\-\.]+\.[a-zA-Z]{2,3}(\/\S*)?'

matches = re.findall(regex, string)

for match in matches:
    print(match)

该代码将输出：

http://www.example.com

总结

本文介绍了如何使用正则表达式来匹配网站链接。我们为 HTTP 链接、HTTPS 链接和不包含协议的链接分别提供了正则表达式示例。无论你选用哪一种，记得在程序中使用相关的正则表达式模块进行匹配。