Ming c Li Blog

Search two files with similar content by Perl

Wednesday, July 19, 2006, 01:45 PM

Sometimes I need to compare two similar files and get to know what entry is in file1, what entry is in file2, I have tried read each file and loop through the files line by line and compare the lines with regex, but it turned out to be very dumb idea and I never got it working. After reading more Perl doc and mailling list, apparently, the perl hash variable is the way to define uniqness of a record, here is the perl script I I got it working:

#!/usr/bin/perl
use strict;
use warnings;
use Tie::File;

tie my @array1, 'Tie::File', "/tmp/u1.txt" or die "Could not tie: $!";

tie my @array2, 'Tie::File', "/tmp/u2.txt" or die "Could not tie: $!";

my %index;
foreach my $entry (@array1) {
if (not exists $index{$entry}) {
$index{$entry} = 1;
}
}

foreach my $entry (@array2) {
if (not exists $index{$entry}) {
$index{$entry} = 2;
}
else {
$index{$entry} += 2;
}
}

foreach my $entry (keys %index) {
if ($index{$entry} == 1) {
print "Entry $entry is only in file one\n";
}
elsif ($index{$entry} == 2) {
print "Entry $entry is only in file two\n";
}
elsif ($index{$entry} == 3) {
print "Entry $entry is in both files\n";
}
else {
print "Entry $entry is screwed up!\n";
}
}

[ add comment ] permalink

( 3 / 79 )

A Perl tip to replace strings, symbols..

Wednesday, July 19, 2006, 11:11 AM

perl -pi'.orig' -e 's/oldstring/newstring/g' fileA, fileB...the i argument is backup option, fileA, fileB will be backed up as fileA.orig, fileB.orig. Ever have format problem with '\r','^M', '\n' after transfering files between windows/mac/linux. try perl -pi'.orig' -e 's/\r(\n)?/\n/g' fileA, fileB...

If on unix/linux, try unix2dos command

[ add comment ] permalink

( 3.1 / 70 )

Wonder how to get two way packets from tcpdump?

Tuesday, July 18, 2006, 03:50 PM

Such a simple question, silly me. run tcpdump on both end and specify the src host and dst port or src port.

[ add comment ] permalink

( 3.1 / 82 )

MySQL setup hassle

Tuesday, July 18, 2006, 02:06 PM

While fiddling with mysql setup, the new mysql command client connect to the new mysql server fine, but not old mysql command client, so I have to make symbolic link to the new mysql command client like ln -s /usr/local/mysql/bin/mysql /usr/bin/mysql, same as mysqladmin...

I thought the above symbolic link should solve all the setup problem, but it didn't. When I run the WebCalendar setup and test mysql conncetion from the web setup interface, error "could not connect /var/lib/mysql/mysql.sock..." pops up, damn! why would the WebCalendar look for the old mysql server socket file instead of the new one /tmp/mysql.sock? is this PHP mysql client problem? To work around this issue, I reset the socket file to /var/lib/mysql/mysql.sock in the mysql client and server section of /etc/my.cnf "socket = /var/lib/mysql/mysql.sock" and restarted msyqld. this sovled the socket problem.
I was so happy to get rid of the complain and test the mysql connection again, but guess what, I came across another problem "Client does not support authentication protocol". I googled around the Internet, found the http://dev.mysql.com/doc/refman/5.0/en/old-client.html link. (pre-4.1) mysql clients does not support the new hash algorithm, well, I don't know how the WebCalendar PHP application call the old mysql client and don't want to update all mysql client libraries. I could reset the WebCalendar mysql user password to pre-4.1 style:

mysql> SET PASSWORD FOR
-> 'webcalendar'@'localhost' = OLD_PASSWORD('newpwd');

I can identify these accounts with the following query:

mysql> SELECT Host, User, Password FROM mysql.user
-> WHERE LENGTH(Password) > 16;

when you

mysql> desc mysql.user;

you will see the pos-4.1 mysql use varchar(41) to define the "Password" colum.

After I reset the Password, all works fine

[ add comment ] permalink

( 2.9 / 80 )

Floppy on Fedora Core 3

Thursday, July 13, 2006, 10:55 AM

Today, I tried to dd if=boot.img of=/dev/fd0 bs=1024 conv=sync; sync , but I always got error message "dd: opening '/dev/fd0', read-only file system". then I fdformatted /dev/fd0 and did the dd command again, it works

[ add comment ] permalink

( 3 / 74 )

Back Next

Links

Home
Contact Me
Stats

Calendar

Archives

View Archives

2009
- June
- May
- April
- February
- January
2008
- June
- May
- April
- January
2007
- December
- November
- September
- August
- June
- April
- March
- February
- January
2006
- December
- November
- October
- September
- August
- July
  - Octal and Octet
    07/31/06
    The octal numeral system is the base-8 number system, and uses the digits 0
  - Upgraded Mail::SpamAssassin 3.1.4, Clamav 0.88.3. Installed gnugpg on Mac OS X server
    07/31/06
    The email spam filter hasn't been running for a while on the email ser
  - Upgrade SpamAssassin 3.1.1 to SpamAssassin 3.1.4
    07/27/06
    There is spamassassin rule lint info after upgrading from 3.1.1 to 3.1.4,
  - What is Mail::SpamAssassin::DnsResolver for?
    07/27/06
    From the perldoc DESCRIPTION section of DnsResolver.pm, it says :" Thi
  - Perl Module installation from CPAN
    07/26/06
    Today, when I am testing the latest SpamAssassin 3.1.4 release, I get a not
  - ClamAV upgrade on Fedora
    07/25/06
    Upgrade ClamAV is plain easy on Fedora, backup ClamAV config file and downl
  - Logrotate on Debian
    07/25/06
    I have been running apache webserver on an lower end old PC (PII450/192RAM/
  - PHP Fatal error: Call to undefined function: mysql_pconnect()
    07/24/06
    Today, while I am trying to access the webcalendar from browser, nothing sh
  - Search two files with similar content by Perl
    07/19/06
    Sometimes I need to compare two similar files and get to know what entry
  - A Perl tip to replace strings, symbols..
    07/19/06
    perl -pi'.orig' -e 's/oldstring/newstring/g' fileA, fil
  - Wonder how to get two way packets from tcpdump?
    07/18/06
    Such a simple question, silly me. run tcpdump on both end and specify the s
  - MySQL setup hassle
    07/18/06
    While fiddling with mysql setup, the new mysql command client connect to th
  - Floppy on Fedora Core 3
    07/13/06
    Today, I tried to dd if=boot.img of=/dev/fd0 bs=1024 conv=sync; sync , but
  - Apple ical does not support ssl over http
    07/12/06
    While testing WebDav and iCal setup, it seems iCal does not know what http
  - Back to work from vacation
    07/10/06
    I went to Victoria and Nanaimo with my wife and couple of friends during Ca
- June

Counter Totals

Total: 199
Today: 199
Yesterday: 0

Most Recent Entries

Return page_count(page) - !!page_has_private(page) == 2 discussion

kernel virtual address to size caculation chat log

Learning Linux Kernel series 1 - Buffer Head

Asterisk app_mp3 patch to play m3u playlist file

Linux file system ext4 debate

Use vim and cscope to read large C source code project

Asterisk App DISA usage and programing control

Tip to turn on vim syntax highlighting on os x 10.5

mp3 to audio cd converter on Linux

Linux filesystem events API