Numpy randn() sampling in a range of 1 and -1

Question

pythonbegin 0 Light Poster

14 Years Ago

Hi All

I have a query about Numpy randn() function to generate random samples from standard normal distribution. I want to add some random samples using this function to my data and I want these samples must be in a range of 1 and -1.

I using this function

y = randn(100,20)
a = X + y

X is a old data matrix(100 rows and 20 columns)with values normalised between 0 and 1
a is a datamatrix with random samples y added to each cell.

but when I do this a.max() is always coming greater than 1. Is there any way to get the y in a range of 1 and -1. or Someway I can specify a range in the function itself.

Thanks in advance!

python

2 Contributors
5 Replies
2K Views
2 Days Discussion Span
Latest Post 14 Years Ago Latest Post by pythonbegin

All 5 Replies

Gribouillis 1,391 Programming Explorer

14 Years Ago

If you only want to add some random noise to your data, you could add a normal random sample with a small standard deviation and clip the result in the interval [-1.0, 1.0] like this

sigma = 0.1
a = numpy.clip(X + sigma * randn(100, 20), -1.0, 1.0)

However, due to the clipping, your array won't exactly be a normal perturbation of the initial array, and this distorsion increases with the value of sigma. Mathematically speaking, it is impossible to require both a normal distribution and bounds on the possible values, but if sigma is small enough, the amount of clipped values will be small and the distorsion can be neglected.

vegaseat commented: goog explanation +14

pythonbegin commented: short and sweet +2

Gribouillis 1,391 Programming Explorer

14 Years Ago

Thanks for the excellent description.
Perfect. Is ther anyway to use lognormal in the same way as randn? to choose sample from log normal distribution.

I suppose so. Write numpy.random.lognormal(mean, sigma, (100,20)) . But can you explain what your matrix is and why you want to add random samples, and also why should the result be in [-1,1] ?

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

pythonbegin 0 Light Poster · Answer 1 · 2011-04-22T11:50:47+00:00

Sorry, in the last line I mean to say- Is there any way to get the 'a' in a range of 1 and -1.

Hi All
I have a query about Numpy randn() function to generate random samples from standard normal distribution. I want to add some random samples using this function to my data and I want these samples must be in a range of 1 and -1.
I using this function
y = randn(100,20)
a = X + y
X is a old data matrix(100 rows and 20 columns)with values normalised between 0 and 1
a is a datamatrix with random samples y added to each cell.
but when I do this a.max() is always coming greater than 1. Is there any way to get the y in a range of 1 and -1. or Someway I can specify a range in the function itself.
Thanks in advance!

pythonbegin 0 Light Poster · Answer 2 · 2011-04-24T10:07:37+00:00

Thanks for the excellent description.

Perfect. Is ther anyway to use lognormal in the same way as randn? to choose sample from log normal distribution.

If you only want to add some random noise to your data, you could add a normal random sample with a small standard deviation and clip the result in the interval [-1.0, 1.0] like this
sigma = 0.1
a = numpy.clip(X + sigma * randn(100, 20), -1.0, 1.0)
However, due to the clipping, your array won't exactly be a normal perturbation of the initial array, and this distorsion increases with the value of sigma. Mathematically speaking, it is impossible to require both a normal distribution and bounds on the possible values, but if sigma is small enough, the amount of clipped values will be small and the distorsion can be neglected.

pythonbegin 0 Light Poster · Answer 3 · 2011-04-24T11:29:56+00:00

Thanks.
Data is a series of output from a program (values between 0 and 1 generated from log normal gaussian distribution). I checked the performance of the model on this data and now i want to add some random noise to it and want to check the performance again to see effect of noise. As original values are in the of 0 to 1, I need to add noise in the range of 0 and 1. Once random samples generated in the range of [-1,1], i can take absolute values for range of [0,1].

Numpy randn() sampling in a range of 1 and -1

Recommended Answers Collapse Answers

All 5 Replies

Recommended Answers