Unix Notes
- byte swapping
- dd if=[infile] of=[outfile] conv=swab
- text prepping - choosing specific columns of text from a tab delimited file
- cut -f 2,5 <inputfile.txt>
- text prepping - taking text and leaving only one word per line
- cat <inputfile.txt> | tr -cs [:alnum:] '\n' > <outputfile.txt>
- text prepping - counting instances of each word
- sort <inputfile.txt> | uniq -c | sort -nr > <outputfile.txt>
- text prepping - creating bigrams
- tail +2 <onewordperline.txt> > <outputskippingfirstword.txt>
- paste <onewordperline.txt> <outputskippingfirstword.txt>