Estimation for Pi is precise but inaccurate

Question

My code reliably finds a precise (similar value every time) but in the region of 3.24 and 3.26. Competent mathematicians will note this around 0.1 away from an ideal Pi value.

My code revolves around picking random points on a 10,000 x 10,000 grid. The hypotenuse with respect to (0,0) is calculated and if this hypotenuse is less than 10,000, the point is inside a theoretical circle with radius 10,000.

The ratio of points inside and all points multiplied by 4 should give an estimation for Pi.

#include <stdio.h>
#include <stdlib.h>
#include <math.h>
#include <time.h>

int RandomNum(int min, int max);

int main()
{
    int d = 0;
    int n = 0;
    srand(time(NULL));


    for (int i =0; i < 10000; i++)
    {
        n++;
        int xI = RandomNum(-10000, 10000);
        //double xF = (double) xI/10000;
        printf("
x: %f", xI);

        int yI = RandomNum(-10000, 10000);
        //double yF = (double) yI/10000;
        printf("
y: %f", yI);

        double pythag = sqrt(xI*xI+yI*yI);
        printf("
hyp: %f

",pythag);

        if (pythag < 10000)
        {
            d++;
        }

    }
    printf("d = %d
", d);
    printf("n = %d
", n);
    double DNratio = (double) d/n;
    double PiEstimate = DNratio * 4;
    printf("Pi Estimate: %f", PiEstimate);
    return 0;
}
int RandomNum(int min, int max)
{
  int r = rand()%(max - min + 1) + min;
  return r;
}

//https://www.geeksforgeeks.org/c/generating-random-number-range-c/
//https://www.tutorialspoint.com/c_standard_library/c_function_srand.htm

Robby Cornelissen · Accepted Answer

The behavior you're seeing is most likely the result of a modulo bias, due to the small RAND_MAX value that is being used by the specific C library/compiler's implementation of the rand() function.

I get similar results to yours when testing with MSVC (x86 msvc v19.latest), where the RAND_MAX value is just 32,767. Using a modulo 10,000 operation (% 10000) on the output of rand() will generate "random" numbers with a very significant bias towards the lower numbers.⁽¹⁾ This causes more points to fall within your circle than you would normally expect, and results in an inflated estimated value of π.⁽²⁾

To demonstrate this: if we pick the random values between 0 and 32,767 and compare the hypotenuse with that value, we eliminate the modulo bias altogether, and the results much more closely approach the value of π as can be seen here:⁽³⁾

#include <stdio.h>
#include <stdlib.h>
#include <math.h>
#include <time.h>

int main()
{
    int d = 0;
    int n = 0;
    srand(time(NULL));

    for (int i =0; i < 10000; i++)
    {
        n++;
        int xI = rand();
        int yI = rand();

        double pythag = sqrt(xI*xI + yI*yI);

        if (pythag <= RAND_MAX)
        {
            d++;
        }
    }
    
    printf("d = %d
", d);
    printf("n = %d
", n);
    double DNratio = ((double) d)/n;
    double PiEstimate = DNratio * 4;
    printf("Pi Estimate: %f", PiEstimate); // yields ≈3.14
    return 0;
}

^{⁽¹⁾ Your implementation actually seems to be doing modulo 20,001, resulting in an even bigger bias. For a RAND_MAX value of 32,767, the rand() % 20001 operation produces numbers in the interval [0 .. 12,766] twice as often as numbers in the interval [12,767 .. 20,000].

⁽²⁾ When using a compiler with a larger RAND_MAX value (like x86_64 gcc (trunk) where it's 2,147,483,647), the bias is much less noticeable, but still there.

⁽³⁾ This of course doesn't negate the effects of the relatively small random sample size, and the inherent bias of the PRNG itself.}

Luke Sharkey · Answer

We can remove the biases introduced by the random function by simply iterating from -R to R in both x and y directions.

I've also changed sqrt(x * x + y * y) < R to x * x + y * y < R * R which allows us to avoid a floating point operations and a call to sqrt . To be even more accurate, we can use (R+1) * R to expand the circle by about half a unit to get better results on edge cases.

#include <stdio.h>

#define R   10000

int main()
{
    int d = 0;
    int n = (2*R+1) * (2*R+1);

    for (int x = -R; x <= R; x++)
        for (int y = -R; y <= R; y++)
            if (x * x + y * y < (R+1) * R)
                d++;

    printf("d = %d
", d);
    printf("n = %d
", n);
    double DNratio = (double) d/n;
    double PiEstimate = DNratio * 4;
    printf("Pi Estimate: %f", PiEstimate);
    return 0;
}

This example gives:

d = 314190797
n = 400040001
Pi Estimate: 3.141594

The intuition behind growing the circle by half a unit is:

Each point really represents the 1x1 area centred on that point, so by only comparing against the centres of these 1x1 squares, we are counting a shape about half a unit smaller than the actual area of the circle. To compensate, we grow the circle by half a unit to match the boundaries of our squares. (And R(R+1) roughly equals (R+0.5)^2 if that wasn't obvious!)

Estimation for Pi is precise but inaccurate

Tags:

c

math

pi

montecarlo

Sam Wray-Twell

2 Answers

Robby Cornelissen

Luke Sharkey

Recent Activity

Donate For Us

Estimation for Pi is precise but inaccurate

Tags:

c

math

pi

montecarlo

Sam Wray-Twell

2 Answers

Robby Cornelissen

Luke Sharkey

Related questions

Recent Activity

Donate For Us