1. Learn
  2. /
  3. Courses
  4. /
  5. Statistical Thinking in Python (Part 2)

Connected

Exercise

Generating a permutation sample

In the video, you learned that permutation sampling is a great way to simulate the hypothesis that two variables have identical probability distributions. This is often a hypothesis you want to test, so in this exercise, we will write a function to generate a permutation sample from two data sets.

Remember, a permutation sample of two arrays having respectively n1 and n2 entries is constructed by concatenating the arrays together, scrambling the contents of the concatenated array, and then taking the first n1 entries as the permutation sample of the first array and the last n2 entries as the permutation sample of the second array.

Instructions

100 XP
  • Define a function with this signature: permutation_sample(data_1, data_2).
    • Concatenate the two input arrays into one using np.concatenate(). Be sure to pass in data_1 and data_2 as one argument `(data1, data2).
    • Use np.random.permutation() to permute the concatenated array.
    • Store the first len(data_1) entries as perm_sample_1 and the last len(data_2) entries as perm_sample_2.
    • Return perm_sample_1 and perm_sample_2.