将XML集合(Pivotal Tracker故事)转换为Ruby哈希/对象 [英] Convert XML collection (of Pivotal Tracker stories) to Ruby hash/object
问题描述
我收集了一些XML格式的故事。我想分析这个文件并将每个故事作为散列或者Ruby对象返回,这样我就可以在一个Ruby脚本中进一步操纵数据。
I have a collection of stories in an XML format. I would like to parse the file and return each story as either hash or Ruby object, so that I can further manipulate the data within a Ruby script.
是否 Nokogiri 支持这个,还是有更好的工具/库可以使用?
Does Nokogiri support this, or is there a better tool/library to use?
XML文档具有以下结构,通过 Pivotal Tracker的web API 返回:
The XML document has the following structure, returned via Pivotal Tracker's web API:
<?xml version="1.0" encoding="UTF-8"?>
<stories type="array" count="145" total="145">
<story>
<id type="integer">16376</id>
<story_type>feature</story_type>
<url>http://www.pivotaltracker.com/story/show/16376</url>
<estimate type="integer">2</estimate>
<current_state>accepted</current_state>
<description>A description</description>
<name>Receivable index listing will allow selection viewing</name>
<requested_by>Tony Superman</requested_by>
<owned_by>Tony Superman</owned_by>
<created_at type="datetime">2009/11/04 15:49:43 WST</created_at>
<accepted_at type="datetime">2009/11/10 11:06:16 WST</accepted_at>
<labels>index ui,receivables</labels>
</story>
<story>
<id type="integer">17427</id>
<story_type>feature</story_type>
<url>http://www.pivotaltracker.com/story/show/17427</url>
<estimate type="integer">3</estimate>
<current_state>unscheduled</current_state>
<description></description>
<name>Validations in wizards based on direction</name>
<requested_by>Matthew McBoggle</requested_by>
<created_at type="datetime">2009/11/17 15:52:06 WST</created_at>
</story>
<story>
<id type="integer">17426</id>
<story_type>feature</story_type>
<url>http://www.pivotaltracker.com/story/show/17426</url>
<estimate type="integer">2</estimate>
<current_state>unscheduled</current_state>
<description>Manual payment needs a description field.</description>
<name>Add description to manual payment</name>
<requested_by>Tony Superman</requested_by>
<created_at type="datetime">2009/11/17 15:10:41 WST</created_at>
<labels>payment process</labels>
</story>
<story>
<id type="integer">17636</id>
<story_type>feature</story_type>
<url>http://www.pivotaltracker.com/story/show/17636</url>
<estimate type="integer">3</estimate>
<current_state>unscheduled</current_state>
<description>The SMS and email templates needs to be editable by merchants.</description>
<name>Notifications are editable by the merchant</name>
<requested_by>Matthew McBoggle</requested_by>
<created_at type="datetime">2009/11/19 16:44:08 WST</created_at>
</story>
</stories>
推荐答案
您可以利用ActiveSupport中的哈希扩展。然后,您只需要在Nokogiri中解析文档,然后将节点集结果转换为散列。此方法将保留属性类型(例如整数,日期,数组)。 (当然,如果你使用Rails,你不需要/包含主动支持或nokogiri,如果你在你的环境中有这个支持,我假设在这里有一个纯Ruby实现。)
You can leverage the Hash extensions in ActiveSupport. Then you just need to parse your document in Nokogiri and then convert the nodeset result into a hash. This method will preserve attribute typing (eg integers, dates, arrays). (Of course if you're using Rails you don't have to require/include active support or nokogiri if you have it in your environment. I'm assuming a pure Ruby implementation here.)
require 'rubygems'
require 'nokogiri'
require 'activesupport'
include ActiveSupport::CoreExtensions::Hash
doc = Nokogiri::XML.parse(File.read('yourdoc.xml'))
my_hash = doc.search('//story').map{ |e| Hash.from_xml(e.to_xml)['story'] }
这会产生一个散列数组(每个故事节点一个),并保留基于属性的打字,如下所示:
This will produce an array of hashes (one for each story node), and preserve the typing based on the attributes, as demonstrated below:
my_hash.first['name']
=> "Receivable index listing will allow selection viewing"
my_hash.first['id']
=> 16376
my_hash.first['id'].class
=> Fixnum
my_hash.first['created_at'].class
=> Time
这篇关于将XML集合(Pivotal Tracker故事)转换为Ruby哈希/对象的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!