如何在PowerShell中从Web下载文件/子文件夹的整个文件夹 [英] How to download a whole folder of files/subfolders from the web in PowerShell

查看:603
本文介绍了如何在PowerShell中从Web下载文件/子文件夹的整个文件夹的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我可以使用以下方式从Web下载单个文件:

I can download a single file from the web using:

$wc = New-Object System.Net.WebClient
$wc.DownloadFile("http://blah/root/somefile.ext", "C:\Downloads\www\blah\root\somefile.ext")

但是如何下载所有文件,包括子文件夹?以下的东西将会很好...

But how do I download all the files, including subfolders? Something like the following would be nice...

$wc.DownloadFile("http://blah/root/", "C:\Downloads\www\blah\root\")

根文件夹本身在IE中显示为目录列表,您知道,如:

The root folder itself appears as a directory listing in IE, you know, like:

[To Parent Directory]
                01 July 2012    09:00       1234 somefile.ext
                01 July 2012    09:01       1234 someotherfile.ext

作为一个奖励,我将如何下载根文件夹中的文件,忽略子文件夹?

As a bonus, how would I just downloading the files in the root folder, ignoring subfolders?

推荐答案

基于Andy的建议(当然有很多来自Google的帮助):

Here's what I came up with based on Andy's suggestion (with plenty of help from Google, of course):

####################################################################################################
# This function copies a folder (and optionally, its subfolders)
#
# When copying subfolders it calls itself recursively
#
# Requires WebClient object $webClient defined, e.g. $webClient = New-Object System.Net.WebClient
#
# Parameters:
#   $source      - The url of folder to copy, with trailing /, e.g. http://website/folder/structure/
#   $destination - The folder to copy $source to, with trailing \ e.g. D:\CopyOfStructure\
#   $recursive   - True if subfolders of $source are also to be copied or False to ignore subfolders
#   Return       - None
####################################################################################################
Function Copy-Folder([string]$source, [string]$destination, [bool]$recursive) {
    if (!$(Test-Path($destination))) {
        New-Item $destination -type directory -Force
    }

    # Get the file list from the web page
    $webString = $webClient.DownloadString($source)
    $lines = [Regex]::Split($webString, "<br>")
    # Parse each line, looking for files and folders
    foreach ($line in $lines) {
        if ($line.ToUpper().Contains("HREF")) {
            # File or Folder
            if (!$line.ToUpper().Contains("[TO PARENT DIRECTORY]")) {
                # Not Parent Folder entry
                $items =[Regex]::Split($line, """")
                $items = [Regex]::Split($items[2], "(>|<)")
                $item = $items[2]
                if ($line.ToLower().Contains("&lt;dir&gt")) {
                    # Folder
                    if ($recursive) {
                        # Subfolder copy required
                        Copy-Folder "$source$item/" "$destination$item/" $recursive
                    } else {
                        # Subfolder copy not required
                    }
                } else {
                    # File
                    $webClient.DownloadFile("$source$item", "$destination$item")
                }
            }
        }
    }
}

当然没有保证,但它对我感兴趣的网站有效

No guarantees of course, but it worked for the site I was interested in

这篇关于如何在PowerShell中从Web下载文件/子文件夹的整个文件夹的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆