如何在PowerShell中从Web下载文件/子文件夹的整个文件夹 [英] How to download a whole folder of files/subfolders from the web in PowerShell
本文介绍了如何在PowerShell中从Web下载文件/子文件夹的整个文件夹的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我可以使用以下方式从Web下载单个文件:
I can download a single file from the web using:
$wc = New-Object System.Net.WebClient
$wc.DownloadFile("http://blah/root/somefile.ext", "C:\Downloads\www\blah\root\somefile.ext")
但是如何下载所有文件,包括子文件夹?以下的东西将会很好...
But how do I download all the files, including subfolders? Something like the following would be nice...
$wc.DownloadFile("http://blah/root/", "C:\Downloads\www\blah\root\")
根文件夹本身在IE中显示为目录列表,您知道,如:
The root folder itself appears as a directory listing in IE, you know, like:
[To Parent Directory]
01 July 2012 09:00 1234 somefile.ext
01 July 2012 09:01 1234 someotherfile.ext
作为一个奖励,我将如何下载根文件夹中的文件,忽略子文件夹?
As a bonus, how would I just downloading the files in the root folder, ignoring subfolders?
推荐答案
基于Andy的建议(当然有很多来自Google的帮助):
Here's what I came up with based on Andy's suggestion (with plenty of help from Google, of course):
####################################################################################################
# This function copies a folder (and optionally, its subfolders)
#
# When copying subfolders it calls itself recursively
#
# Requires WebClient object $webClient defined, e.g. $webClient = New-Object System.Net.WebClient
#
# Parameters:
# $source - The url of folder to copy, with trailing /, e.g. http://website/folder/structure/
# $destination - The folder to copy $source to, with trailing \ e.g. D:\CopyOfStructure\
# $recursive - True if subfolders of $source are also to be copied or False to ignore subfolders
# Return - None
####################################################################################################
Function Copy-Folder([string]$source, [string]$destination, [bool]$recursive) {
if (!$(Test-Path($destination))) {
New-Item $destination -type directory -Force
}
# Get the file list from the web page
$webString = $webClient.DownloadString($source)
$lines = [Regex]::Split($webString, "<br>")
# Parse each line, looking for files and folders
foreach ($line in $lines) {
if ($line.ToUpper().Contains("HREF")) {
# File or Folder
if (!$line.ToUpper().Contains("[TO PARENT DIRECTORY]")) {
# Not Parent Folder entry
$items =[Regex]::Split($line, """")
$items = [Regex]::Split($items[2], "(>|<)")
$item = $items[2]
if ($line.ToLower().Contains("<dir>")) {
# Folder
if ($recursive) {
# Subfolder copy required
Copy-Folder "$source$item/" "$destination$item/" $recursive
} else {
# Subfolder copy not required
}
} else {
# File
$webClient.DownloadFile("$source$item", "$destination$item")
}
}
}
}
}
当然没有保证,但它对我感兴趣的网站有效
No guarantees of course, but it worked for the site I was interested in
这篇关于如何在PowerShell中从Web下载文件/子文件夹的整个文件夹的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文