Dear devs,
2 small unix related questions.
does anyone know about:
- a quick and easy way to remove duplicate lines from a text files?
- a quick and easy way to decide if two image (binary) files are the
same
image/picture?
thanks in advance for your help!
-g.
Hi,
- a quick and easy way to remove duplicate lines from a text files?
cat file | sort | uniq > file
should work (if the sorting doesn’t matter in the file)
cat file | uniq > file
works if the duplicate lines are next to each other
- a quick and easy way to decide if two image (binary) files are the same
image/picture?
findimagedupes << search for that in google, I’m using one of those
perl scripts.
thanks in advance for your help!
Hope that helps,
Jo
On Feb 4, 2008 5:09 PM, George M.
[email protected] wrote:
Dear devs,
2 small unix related questions.
does anyone know about:
- a quick and easy way to remove duplicate lines from a text files?
uniq < filename (assuming that the lines are consequtive; otherwise,
you need to do a sort first, or keep the lines in a hash in either
Ruby or Perl)
- a quick and easy way to decide if two image (binary) files are the same
image/picture?
Do you mean if they are identical? cmp should do it. Another
approach is to get a checksum of the files (e.g. using “openssl rmd160
”) and see if the checksum is the same.
Eivind.
Thanks for the help everyone!
On Mon, 2008-02-04 at 18:09 +0200, George M. wrote:
Dear devs,
2 small unix related questions.
does anyone know about:
delete duplicate, nonconsecutive lines from a file. Beware not to
overflow the buffer size of the hold space, or else use GNU sed.
sed -n ‘G; s/\n/&&/; /^([ -~]\n).\n\1/d; s/\n//; h; P’
- a quick and easy way to decide if two image (binary) files are the
same image/picture?
diff will tell you if the files are different also
rthompso@raker ~ $ diff 10000_Galaxies,_HST_Ultra_Deep.png
Deathvalleysky_nps_big.png
Files 10000_Galaxies,_HST_Ultra_Deep.png and Deathvalleysky_nps_big.png
differ
rthompso@raker ~ $ cp 10000_Galaxies,_HST_Ultra_Deep.png junk.png
rthompso@raker ~ $ diff 10000_Galaxies,_HST_Ultra_Deep.png junk.png