从具有多行记录的文件中提取数据,将每个范围保存到单独的文件中 [英] Extracting data from a file with multi-line records,saving each range to a separate file

查看:49
本文介绍了从具有多行记录的文件中提取数据,将每个范围保存到单独的文件中的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个看起来像这样的文件

I have a file that looks like this

     *TRANSACTION STARTED*
[020t CARD INSERTED
[020tCARD: *************5845
DATE 01-02-16    TIME 05:45:52
 05:46:26 GENAC 1 : ARQC
EXTERNAL AUTHENTICATE: NO ARPC
 05:46:30 GENAC 2 : AAC
 01 FEB 2016     05:47:41      10160021

     WITHDRAW
     FROM XXXXXXXX    ?
INVALID TRANSCATION ON TERMINAL.
-----------------------
[020t 05:47:05 CARD TAKEN
[020t 05:47:07 TRANSACTION END
[0r(1)2[000p[040qe1w3h162[020t*742*02/01/2016*05:47*
     *TRANSACTION STARTED*
[020t CARD INSERTED
[020tCARD: *************2584
DATE 01-02-16    TIME 05:47:27
 05:48:00 GENAC 1 : ARQC
 05:48:05 GENAC 2 : TC
[020t 05:48:16 CARD TAKEN
[020t 05:48:22 NOTES PRESENTED 0,0,2,0
 01 FEB 2016     05:48:52      10160021

     WITHDRAW
     FROM XXXXXXXX    ?
AMT   GHC40.00
[020t 05:48:31 TRANSACTION END
[0r(1)2[000p[040qe1w3h162[020t*743*02/01/2016*05:57*
     *TRANSACTION STARTED*
[020t CARD INSERTED
[020tCARD: *************3862
DATE 01-02-16    TIME 05:57:28
 01 FEB 2016     05:58:33      10160021

     INQUIRY
     FROM XXXXXXXX90018
AVAIL          GHC1260.20  
LEDGER         GHC1260.20  
[020t 05:58:06 CARD TAKEN
[020t 05:58:11 TRANSACTION END
[0r(1)2[000p[040qe1w3h162[020t*744*02/01/2016*06:43*
     *TRANSACTION STARTED*
[020t CARD INSERTED
[020tCARD: *************1972
DATE 01-02-16    TIME 06:43:53
 01 FEB 2016     06:44:56      10160021
5029110111271972
4490    4490
     INQUIRY
     FROM XXXXXXXX23013
AVAIL          GHC14.28
LEDGER         GHC14.28
[020t 06:44:25 CARD TAKEN
[020t 06:44:29 TRANSACTION END
[0r(1)2[000p[040qe1w3h162[020t*745*02/01/2016*06:56*

并且需要提取 *TRANSACTION STARTED*TRANSACTION END 之间的内容,忽略所有其他信息,并为每个范围创建一个新文件.
新文件将只包含

and need to extract what is between *TRANSACTION STARTED* and TRANSACTION END, ignoring all other information, and create a new file for each range.
The new file will contain only

    [020t CARD INSERTED
    [020tCARD: *************2584
    DATE 01-02-16    TIME 05:47:27
     05:48:00 GENAC 1 : ARQC
     05:48:05 GENAC 2 : TC
    [020t 05:48:16 CARD TAKEN
    [020t 05:48:22 NOTES PRESENTED 0,0,2,0
     01 FEB 2016     05:48:52      10160021

         WITHDRAW
         FROM XXXXXXXX    ?
    AMT   GHC40.00
    [020t 05:48:31

我对以前的做了一些修改,但我收到了这个错误:

I have done some modification on the previous but i get this Error:

Can't use string ("/home/lord-ivan/Soures_Code/Perl"...) as a SCALAR ref while "strict refs" in use at readfile.pl line 31, <$fh> line 1.

我的新代码

#! /usr/bin/perl/ 

use warnings;
use strict;

#file to open
my $somefile = "/home/lord-ivan/Soures_Code/Perl/projects/Data/EJDATA.LOG";

open (my $fh, '<:encoding(UTF-8)', $somefile) or die "Could not open file '$somefile' $!"; 
print "$somefile open";

#Extract ranges of lines from a file

while (<$fh>) {

    if (/TRANSACTION STARTED/ .. /TRANSACTION END/) 
    {
         next if /TRANSACTION\s*(STARTED|END)/;
        print $_;   
    }
}


my $ofh;

my $outputfile = "/home/lord-ivan/Soures_Code/Perl/projects/EJ Transport/Queue/.txt";

BEGIN {$outputfile= "EJ"};

open ($fh, ">>${$outputfile}.txt", print $fh $_);

close($outputfile);
$outputfile++;

为什么不能用.

推荐答案

@Iord-Ivan努力学习

什么是数据类型.?

文件处理

什么是处理程序.

您的 $somefile 仅存储文件名.然后使用 $somefile 迭代 while 循环.但是您的 $somefile 仅包含文件名,不包含文件内容.

Your $somefile store only the file name. Then you iterate the while loop using $somefile. But your $somefile contain only the file name not contain the file content.

您的文件内容存储在 $fh 处理程序中,因此为 $fh 处理程序迭代 while 循环.然后逐行检查.

You file content is stored into the $fh handler so iterate the while loop for $fh handler. Then check it line by line.

use warnings;
use strict;
my $somefile = "input.txt";
open (my $fh, '<:encoding(UTF-8)', $somefile) or die "Could not open file '$somefile' $!"; 
print "$somefile open";
while (<$fh>) {

    if (/TRANSACTION STARTED/ .. /TRANSACTION END/) 
    {
         next if /TRANSACTION\s*(STARTED|END)/;
        print $_;   
    }
}

学习perl后,完成平衡程序(FILE WRITE).:)

After studying perl, finish the balance program(FILE WRITE). :)

这篇关于从具有多行记录的文件中提取数据,将每个范围保存到单独的文件中的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆