m (ndarray, shape (2,)) – mean value of the gaussian distribution
sigma (ndarray, shape (2, 2)) – covariance matrix of the gaussian distribution
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator;
If RandomState instance, random_state is the random number generator;
If None, the random number generator is the RandomState instance used
by np.random.
Returns:
X – n samples drawn from \(\mathcal{N}(m, \sigma)\).
p (float) – proportion of one class in the binary setting
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator;
If RandomState instance, random_state is the random number generator;
If None, the random number generator is the RandomState instance used
by np.random.
Returns:
X (ndarray, shape (n, d)) – n observation of size d
p (int) – dimension of the ambient space the data live in
dim ((int,int) or int) – the intrinsic dimensions of the source and target Gaussian HD distriutions. If a single int the intrinsic dimension is assumed to be the same
m_diff (float) – the shift in the first coordinate of the means of the Gaussian HD distributions, i.e. ms_0 and mt_0, respectively (see code)
a ((float, float)) – positive floating numbers corresponding to the isotropic variances in the principal subspace, for the source and target distributions, respectively. The same as delta in [1], Proposition 2.2
b ((float, float)) – positive floating numbers corresponding to the isotropic variance outside the principal subspace for the source and target distributions, respectively.
sub_the_same (bool) – should the source/target Gaussian HD distributions live in the same principal subspace?
Returns:
Xs (ndarray, shape (ns, p)) – ns observations of size p (source)
Xt (ndarray, shape (nt, p)) – nt observations of size p (destination)
pmts (list) – a list containing the parameters of the Gaussian HD distributions
m (ndarray, shape (2,)) – mean value of the gaussian distribution
sigma (ndarray, shape (2, 2)) – covariance matrix of the gaussian distribution
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator;
If RandomState instance, random_state is the random number generator;
If None, the random number generator is the RandomState instance used
by np.random.
Returns:
X – n samples drawn from \(\mathcal{N}(m, \sigma)\).
Return type:
ndarray, shape (n, 2)
ot.datasets.make_data_classif(dataset, n, nz=0.5, theta=0, p=0.5, random_state=None, **kwargs)[source]
Dataset generation for classification problems
Parameters:
dataset (str) – type of classification problem (see code)
p (float) – proportion of one class in the binary setting
random_state (int, RandomState instance or None, optional (default=None)) – If int, random_state is the seed used by the random number generator;
If RandomState instance, random_state is the random number generator;
If None, the random number generator is the RandomState instance used
by np.random.
Returns:
X (ndarray, shape (n, d)) – n observation of size d
p (int) – dimension of the ambient space the data live in
dim ((int,int) or int) – the intrinsic dimensions of the source and target Gaussian HD distriutions. If a single int the intrinsic dimension is assumed to be the same
m_diff (float) – the shift in the first coordinate of the means of the Gaussian HD distributions, i.e. ms_0 and mt_0, respectively (see code)
a ((float, float)) – positive floating numbers corresponding to the isotropic variances in the principal subspace, for the source and target distributions, respectively. The same as delta in [1], Proposition 2.2
b ((float, float)) – positive floating numbers corresponding to the isotropic variance outside the principal subspace for the source and target distributions, respectively.
sub_the_same (bool) – should the source/target Gaussian HD distributions live in the same principal subspace?
Returns:
Xs (ndarray, shape (ns, p)) – ns observations of size p (source)
Xt (ndarray, shape (nt, p)) – nt observations of size p (destination)
pmts (list) – a list containing the parameters of the Gaussian HD distributions