`spkit.data`.create_dataset¶

spkit.data.create_dataset(N=100, Dtype='GAUSSIANS', noise=0, use_preset=False, return_para=False, **kwargs)¶

Sample a 2D dataset from different distributions

Create 2D dataset for 2-class from different distributions

Parameters:

N: int, default=100

Dtype: str, default=’GAUSSIANS’

noise: scalar [0,1], default=0

return_para: bool, default=False

Other parameters: **kwargs

warn: bool, default=True

1. ‘GAUSSIANS’ parameters: :func:`gaussian`

ndist: scalar, default=3
- number of gaussian for each class.
means: array, shape (2*ndist X 2), default=’random’
- vector of size(2*ndist X 2) with the means of each gaussian.
sigmas: array , default=’random’
- A sequence of covariance matrices of size (2*ndist, 2)

2. MOONS’ parameters: :func:`moons`

s: scalar, default=0.1
- standard deviation of the gaussian noise.
d: scalar, str, default=’random’
- 1x2 translation vector between the two classes.
- With d = 0 the classes are placed on a circle.
angle: scalar , default=’random’
- rotation angle of the moons (radians)

3. ‘LINEAR’ parameters: :func:`linear`

m: scalar, str, default=’random’
- slope of the separating line.
b: scalar, str, default=’random’
- bias of the line. Default is random.
s: float,default= 0.1
- standard deviation of the gaussian noise. Default is 0.1

4. ‘SINUSOIDAL’ parameters: :func:`sinusoidal`

5. ‘SPIRAL’ parameters: :func:`spiral`

s: scalar, default=0.5
- standard deviation of the gaussian noise.
wrappings: scalar, str, default=’random’
- number of wrappings of each spiral.
m: scalar, str, default=’random’
- multiplier m of x * sin(m * x) for the second spiral.

Returns:

X: 2d-array

y: 1d-array

Examples using `spkit.data.create_dataset`¶