Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in text-processing

What is the fastest, most error-free method of extracting and cleaning the HTML body text in Python?

Filtering tokens by frequency using filter_extremes in Gensim

Reuse matched pattern in inserted or appended line with GNU sed

Shell/Bash parsing text file

How to find similar products

How can I detect a sequence of "hollows" (holes, lines not matching a pattern) bigger than n in a text file?

linux shell text-processing

Prepending letter to field value

How to retrieve text from the current line at specified cursor position before and after up to specified boundary characters?

insert a character at x,y,z positions in a string

How to get all the words around a word within a fixed proximity

Escape percent sign (%) in hexdump format

text-processing hexdump

How to delete multiple empty lines with SED?

sed text-processing

Reading email content

c# winforms text-processing

Replace end of line with comma and put parenthesis in sed/awk

awk sed text-processing

Why are sequential regular expressions more efficient than a combined experession?

Find unique lines between two files

unique text-processing comm

term clustering library?

Keyword extraction from short dutch texts