//===========================================================================
/*!
*
*
* \brief The k-means clustering algorithm.
*
*
*
* \author T. Glasmachers
* \date 2011
*
*
* \par Copyright 1995-2017 Shark Development Team
*
*
* This file is part of Shark.
*
*
* Shark is free software: you can redistribute it and/or modify
* it under the terms of the GNU Lesser General Public License as published
* by the Free Software Foundation, either version 3 of the License, or
* (at your option) any later version.
*
* Shark is distributed in the hope that it will be useful,
* but WITHOUT ANY WARRANTY; without even the implied warranty of
* MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
* GNU Lesser General Public License for more details.
*
* You should have received a copy of the GNU Lesser General Public License
* along with Shark. If not, see .
*
*/
//===========================================================================
#ifndef SHARK_ALGORITHMS_APPROXIMATE_KERNEL_EXPANSION_H
#define SHARK_ALGORITHMS_APPROXIMATE_KERNEL_EXPANSION_H
#include
#include
#include
namespace shark{
/// \brief Approximates a kernel expansion by a smaller one using an optimized basis.
///
/// often, a kernel expansion can be represented much more compactly when the points defining the basis of the kernel
/// expansion are not fixed. The resulting kernel expansion can be evaluated much quicker than the original
/// when k is small compared to the number of the nonzero elements in the weight vector of the supplied kernel expansion.
///
/// Given a kernel expansion with weight matrix alpha and Basis B of size m, finds a new weight matrix beta and Basis Z with k vectors
/// so that the difference of the resulting decision vectors is small in the RKHS defined by the supplied kernel.
///
/// The algorithm proceeds by first performing a kMeans clustering as a good initialization. This initial guess is then optimized
/// by finding the closest weight vector to the original vector representable by the basis. Using this estimate, the basis
/// can then be optimized.
///
/// The supplied kernel must be dereferentiable wrt its input parameters which excludes all kernels not defined on RealVector
///
/// The algorithms is O(k^3 + k m) in each iteration.
///
/// \param rng the Rng used for the kMeans clustering
/// \param model the kernel expansion to approximate
/// \param k the number of basis vectors to be used by the approximation
/// \param precision target precision of the gradient to be reached during optimization
SHARK_EXPORT_SYMBOL KernelExpansion approximateKernelExpansion(
random::rng_type& rng,
KernelExpansion const& model,
std::size_t k,
double precision = 1.e-8
);
} // namespace shark
#endif