I'm new in awk scripting and would like to have some help in calculating 95th percentile value for a file that consist of this data: <pre class="prettyprint"><code>0.0001357 0.000112 0.000062 0.000054 0.000127 0.000114 0.000136 </code></pre> I tried: <pre class="prettyprint"><code>cat filename.txt | sort -n | awk 'BEGIN{c=0} {total[c]=$1; c++;} END{print total[int(NR*0.95-0.5)]}' </code></pre> but I dont seem to get the correct value when I compare it to excel.

I am not sure if Excel does some kind of weighted percentile, but if you actually want one of the numbers that was in your original set, then your method should work correctly for rounding. You can simplify a little bit like this, but it's the same thing. <pre class="prettyprint"><code>sort -n input.txt | awk '{all[NR] = $0} END{print all[int(NR*0.95 - 0.5)]}' </code></pre>

Calculating 95th percentile with awk

Tags:

awk

I'm new in awk scripting and would like to have some help in calculating 95th percentile value for a file that consist of this data:

I tried:

cat filename.txt | sort -n |
awk 'BEGIN{c=0} {total[c]=$1; c++;} END{print total[int(NR*0.95-0.5)]}'

but I dont seem to get the correct value when I compare it to excel.

794

asked Jul 11 '14 22:07

user3831155

1 Answers

I am not sure if Excel does some kind of weighted percentile, but if you actually want one of the numbers that was in your original set, then your method should work correctly for rounding.

You can simplify a little bit like this, but it's the same thing.

sort -n input.txt  | awk '{all[NR] = $0} END{print all[int(NR*0.95 - 0.5)]}'

124

answered Oct 04 '22 00:10

merlin2011

Related questions
                            
                                Delete Lines : after pattern1 and between pattern2 and pattern3 using awk/sed/perl
                            
                                combine multiple awk commands
                            
                                Sed/Awk - remove blankspaces / join lines in ldif dump
                            
                                Join two files using awk
                            
                                Separating output records in AWK without a trailing separator
                            
                                Remove first columns then leave remaining line untouched in awk
                            
                                Why does AWK not treat this array index as a number unless I use int()?
                            
                                AWK sub function syntax
                            
                                How to select only the first 10 rows in my AWK script
                            
                                Fix Mismatch Between Data And Local In Awk Command
                            
                                Join lines at pattern. Uneven interval
                            
                                Adding a character after a digit and dot in bash
                            
                                Average of column by hours (rows) using awk
                            
                                Replacing two strings using awk
                            
                                Using awk with variables
                            
                                awk Joining n fields with delimiter
                            
                                Trim first 9 letters using awk, sed
                            
                                Efficient way to count the amount lines obeying some condition
                            
                                Sum number in two different files
                            
                                Count number of column in a pipe delimited file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With