且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

遍历FTP列表

更新时间:2023-11-05 21:07:22

这是一个天真而缓慢的实施。它很慢,因为它试图对每个目录条目进行CWD以确定它是目录还是文件,但是这是有效的。可以通过解析LIST命令输出来优化它,但这是强烈依赖于服务器实现的。

  import ftplib 

def traverse(ftp,depth = 0):

返回一个ftp服务器内容的递归列表(从当前目录开始


列表作为递归字典返回,其中每个键
包含子目录的内容,如果
与文件相对应,则包含无。

@param ftp:ftplib。 FTP对象

if depth> 10:
return ['depth> ('。','..')):
尝试:如果路径不在(ftp.nlst()路径中,
ftp.cwd(entry)
level [entry] =遍历(ftp,depth + 1)
ftp.cwd('..')
除了ftplib.error_perm:
level [entry] = None
返回级别

def main():
ftp = ftplib.FTP(localhost)
ftp.connect )
ftp.login()
ftp.set_pasv(True)

print traverse(ftp)

if __name__ =='__main__':
main()


I am trying to to get all directories' name from an FTP server and store them in hierarchical order in a multidimensional list or dict

So for example, a server that contains the following structure:

/www/
    mysite.com
        images
            png
            jpg

at the end of the script, would give me a list such as

['/www/'
  ['mysite.com'
    ['images'
      ['png'],
      ['jpg']
    ]
  ]
]

I have tried using a recursive function like so: def traverse(dir): FTP.dir(dir, traverse)

FTP.dir returns lines in this format:

drwxr-xr-x    5 leavesc1 leavesc1     4096 Nov 29 20:52 mysite.com

so doing line[56:] will give me just the directory name(mysite.com). I use this in the recursive function.

But i cannot get it to work. I've tried many different approaches and can't get it to work. Lots of FTP errors as well (either can't find the directory - which is a logical issue, and sometimes unexpected errors returned by the server, which leaves no log and i can't debug)

bottom line question: How to get a hierarchical directory listing from an FTP server?

Here is a naive and slow implementation. It is slow because it tries to CWD to each directory entry to determine if it is a directory or a file, but this works. One could optimize it by parsing LIST command output, but this is strongly server-implementation dependent.

import ftplib

def traverse(ftp, depth=0):
    """
    return a recursive listing of an ftp server contents (starting
    from the current directory)

    listing is returned as a recursive dictionary, where each key
    contains a contents of the subdirectory or None if it corresponds
    to a file.

    @param ftp: ftplib.FTP object
    """
    if depth > 10:
        return ['depth > 10']
    level = {}
    for entry in (path for path in ftp.nlst() if path not in ('.', '..')):
        try:
            ftp.cwd(entry)
            level[entry] = traverse(ftp, depth+1)
            ftp.cwd('..')
        except ftplib.error_perm:
            level[entry] = None
    return level

def main():
    ftp = ftplib.FTP("localhost")
    ftp.connect()
    ftp.login()
    ftp.set_pasv(True)

    print traverse(ftp)

if __name__ == '__main__':
    main()