LISTSERV mailing list manager LISTSERV 16.0

Help for LINUX-L Archives


LINUX-L Archives

LINUX-L Archives


LINUX-L@LISTS.UFL.EDU


View:

Message:

[

First

|

Previous

|

Next

|

Last

]

By Topic:

[

First

|

Previous

|

Next

|

Last

]

By Author:

[

First

|

Previous

|

Next

|

Last

]

Font:

Proportional Font

LISTSERV Archives

LISTSERV Archives

LINUX-L Home

LINUX-L Home

LINUX-L  2007

LINUX-L 2007

Subject:

Re: file parsing help

From:

Jim Martinez <[log in to unmask]>

Reply-To:

Platform Independent Linux List! <[log in to unmask]>

Date:

Sat, 17 Feb 2007 19:44:55 -0500

Content-Type:

TEXT/PLAIN

Parts/Attachments:

Parts/Attachments

TEXT/PLAIN (47 lines)

On Feb 17 Allen S. Rout wrote:

> >> On Sat, 17 Feb 2007 17:43:20 -0500, Dan Trevino <[log in to unmask]> said:
> 
> 
> > I need to parse a tab delimited text file of several thousand lines.  The
> > first part is easy;
> 
> > cut -f8 file
> 
> > field 8 of this file contains multiple, variable length, sentences enclosed
> > in double quotes.  Example returned by the cut command above:
> 
> > "this is sentence one. this i sentence two.  this ""is a quote that may be""
> > in sentence three."
> 
> > I need to grab the first sentence for further processing (without the
> > period, without the beginning quote mark) into a variable, but am having
> > difficulty. Can anyone suggest an easy way to do this?  I'm open to
> > bash,perl,python solutions, but prefer bash.
> 

So for the example above what you want is:

this is sentence one

Is that right?

Will there always be a quote to start the line?

> 
> $foo = [ 
> '"this is sentence one. this i sentence two.  this ""is a quote that may be""in sentence three."',
> '"this is lalala. "'
> 	 ];
> 
> print  join ("\n", map { /\"([^.]+)\./; $1 } @$foo ) ;
> 

Putting it together with a slight twist to Allen's solution:

cut -f8 file | perl -lne 's/^"([^.]+)\./$1/ or die "no match:$_...";print 
$1'

hope that helps

Jim

Top of Message | Previous Page | Permalink

Advanced Options


Options

Log In

Log In

Get Password

Get Password


Search Archives

Search Archives


Subscribe or Unsubscribe

Subscribe or Unsubscribe


Archives

2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997

ATOM RSS1 RSS2



LISTS.UFL.EDU

CataList Email List Search Powered by the LISTSERV Email List Manager