且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

使用python从文本文件解析IP地址/网络

更新时间:2023-01-17 16:39:27

首先,你的正则表达式甚至不会尝试捕获除了四个虚线数字之外的任何东西,所以当然它不会与其他任何东西相匹配,比如最后的 / 32 。如果你只是添加,例如 / \d {1,2} 到最后,它将修复:

First, your regex doesn't even attempt to capture anything but four dotted numbers, so of course it's not going to match anything else, like a /32 on the end. if you just add, e.g., /\d{1,2} to the end, it'll fix that:

(?:\d{1,3}\.){3}\d{1,3}/\d{1,2}

Debuggex演示

但是,如果你不能理解正则表达式,那么你可能不应该理解使用正则表达式作为你永远无法调试或扩展的魔法。使用 str 等方法更加冗长,如 split find ,但对新手来说可能更容易理解:

However, if you don't understand regular expressions well enough to understand that, you probably shouldn't be using a regex as a piece of "magic" that you'll never be able to debug or extend. It's a bit more verbose with str methods like split or find, but maybe easier to understand for a novice:

for line in file:
    for part in line.split()
        try:
            address, network = part.split('/')
            a, b, c, d = address.split('.')
        except ValueError:
            pass # not in the right format
        else:
            # do something with part, or address and network, or whatever






作为附注,根据您对这些事情的实际操作,您可能想要使用 ipaddress 模块(或在PyPI上的backport 2.6-3.2)而不是字符串解析:


As a side note, depending on what you're actually doing with these things, you might want to use the ipaddress module (or the backport on PyPI for 2.6-3.2) rather than string parsing:

>>> import ipaddress
>>> s = '10.1.1.1/32'
>>> a = ipaddress.ip_network('10.1.1.1/32')

您可以将其与以上:

for line in file:
    for part in line.split():
        try:
            a = ipaddress.ip_network(part)
        except ValueError:
            pass # not the right format
        else:
            # do something with a and its nifty methods