Publications | Hardik Tankaria

Journal articles

COAP
A regularized limited memory BFGS method for large-scale unconstrained optimization and its efficient implementations

Hardik Tankaria, Shinji Sugimoto, and Nobuo Yamashita

Computational Optimization and Applications, 2022

The limited memory BFGS (L-BFGS) method is one of the popular methods for solving large-scale unconstrained optimization. Since the standard L-BFGS method uses a line search to guarantee its global convergence, it sometimes requires a large number of function evaluations. To overcome the difficulty, we propose a new L-BFGS with a certain regularization technique. We show its global convergence under the usual assumptions. In order to make the method more robust and efficient, we also extend it with several techniques such as the nonmonotone technique and simultaneous use of the Wolfe line search. Finally, we present some numerical results for test problems in CUTEst, which show that the proposed method is robust in terms of solving more problems.

Bib URL
@article{tankaria2022regularized, title = {A regularized limited memory BFGS method for large-scale unconstrained optimization and its efficient implementations}, author = {Tankaria, Hardik and Sugimoto, Shinji and Yamashita, Nobuo}, journal = {Computational Optimization and Applications}, volume = {82}, number = {1}, pages = {61--88}, year = {2022}, publisher = {Springer}, url = {https://link.springer.com/article/10.1007/s10589-022-00351-5}, }
AIMSCIENCES
A stochastic variance reduced gradient using Barzilai-Borwein techniques as second order information

Hardik Tankaria and Nobuo Yamashita

Journal of Industrial and Management Optimization, 2022

In this paper, we consider to improve the stochastic variance reduce gradient (SVRG) method via incorporating the curvature information of the objective function. We propose to reduce the variance of stochastic gradients using the computationally efficient Barzilai-Borwein (BB) method by incorporating it into the SVRG. We also incorporate a BB-step size as its variant. We prove its linear convergence theorem that works not only for the proposed method but also for the other existing variants of SVRG with second-order information. We conduct the numerical experiments on the benchmark datasets and show that the proposed method with constant step size performs better than the existing variance reduced methods for some test problems.

Bib URL
@article{tankaria2022stochastic, title = {A stochastic variance reduced gradient using Barzilai-Borwein techniques as second order information}, author = {Tankaria, Hardik and Yamashita, Nobuo}, journal = {Journal of Industrial and Management Optimization}, year = {2022}, url = {https://www.aimsciences.org/article/doi/10.3934/jimo.2023089}, }

IBISML

Nys-Newton: Nystroem-Approximated Curvature for Convex Optimization

Hardik Tankaria, Dinesh Singh, and Makoto Yamada

電子情報通信学会技術研究報告 (Web), 2022

@article{tankaria2022nys,
  title = {Nys-Newton{:} Nystroem-Approximated Curvature for Convex Optimization},
  author = {Tankaria, Hardik and Singh, Dinesh and Yamada, Makoto},
  journal = {電子情報通信学会技術研究報告 (Web)},
  volume = {122},
  number = {325 (IBISML2022 40-64)},
  pages = {134--141},
  year = {2022},
  url = {https://jglobal.jst.go.jp/en/detail?JGLOBAL_ID=202302216950577696},
}

Preprint

arXiv
Nys-curve: Nyström-approximated curvature for stochastic optimization

Hardik Tankaria, Dinesh Singh, and Makoto Yamada

2021

The quasi-Newton methods generally provide curvature information by approximating the Hessian using the secant equation. However, the secant equation becomes insipid in approximating the Newton step owing to its use of the first-order derivatives. In this study, we propose an approximate Newton step-based stochastic optimization algorithm for large-scale empirical risk minimization of convex functions with linear convergence rates. Specifically, we compute a partial column Hessian of size (d×k) with k <d randomly selected variables, then use the \textitNyström method to better approximate the full Hessian matrix. To further reduce the computational complexity per iteration, we directly compute the update step (\deltaw) without computing and storing the full Hessian or its inverse. Furthermore, to address large-scale scenarios in which even computing a partial Hessian may require significant time, we used distribution-preserving (DP) sub-sampling to compute a partial Hessian. The DP sub-sampling generates p sub-samples with similar first and second-order distribution statistics and selects a single sub-sample at each epoch in a round-robin manner to compute the partial Hessian. We integrate our approximated Hessian with stochastic gradient descent and stochastic variance-reduced gradients to solve the logistic regression problem. The numerical experiments show that the proposed approach was able to obtain a better approximation of Newton’s method with performance competitive with the state-of-the-art first-order and the stochastic quasi-Newton methods.

arXiv Bib
@unpublished{tankaria2021nys, title = {Nys-curve{:} Nystr{\"o}m-approximated curvature for stochastic optimization}, author = {Tankaria, Hardik and Singh, Dinesh and Yamada, Makoto}, journal = {arXiv preprint}, pages = {arXiv--2110}, year = {2021}, }

arXiv

Nys-newton: Nystr\backslash" om-approximated curvature for stochastic optimization

Dinesh Singh, Hardik Tankaria, and Makoto Yamada

2021

arXiv Bib

@unpublished{singh2021nys,
  title = {Nys-newton{:} Nystr$\backslash$" om-approximated curvature for stochastic optimization},
  author = {Singh, Dinesh and Tankaria, Hardik and Yamada, Makoto},
  journal = {arXiv preprint},
  year = {2021},
}

ResearchGate
Regularized Nyström method for Large-Scale Unconstrained Non-Convex Optimization

Hardik Tankaria

2024

We propose a quasi-Newton type regularized Nyström method for solving unconstrained large scale non-convex optimization problem with high-dimensional feature spaces. We focus on l2-regularized objective function. To efficiently approximate the Hessian, we employ a regularized Nystrom method to approximate Hessian. However, the Hessian of non-convex function may produce negative eigenvalues. We confront the issue of negative eigenvalue via incorporating the regularized parameter to obtain positive definite matrix and the descent direction, resulting in to the regulazied fixed-rank Nystrom approximation. This fixed-rank approximation offers flexibility in rank selection through random column sampling, enhancing computational efficiency. Complementing this, we employ backtracking line search to ascertain suitable step sizes, utilizing partial Hessian information to approximate the full Hessian matrix. We establish global convergence under standard assumptions and provide a comprehensive analysis of the global complexity. Finally, we demonstrate numerical experiments to show the robustness and efficiency of the proposed method by comparing it with existing methods.

Bib URL
@unpublished{tankaria2024regularized, title = {Regularized Nystr{\"o}m method for Large-Scale Unconstrained Non-Convex Optimization}, author = {Tankaria, Hardik}, year = {2024}, url = {https://www.researchgate.net/profile/Hardik-Tankaria/publication/380000293_Regularized_Nystrom_method_for_Large-Scale_Unconstrained_Non-Convex_Optimization/links/6626b65443f8df018d21b6eb/Regularized-Nystroem-method-for-Large-Scale-Unconstrained-Non-Convex-Optimization.pdf}, }

Theses

Master Thesis
Hasse-Minkowski principle for Quadratic forms over Q

Hardik Tankaria

2016

This master’s thesis covers the Hasse–Minkowski theorem, a central theorem in number theory. The theorem states that two quadratic forms over a number field are equivalent if and only if they are equivalent at all completions of the number field. The paper looks into the proof of this theorem, in relation to quadratic forms over the rationals. It addresses the local-global principle when exploring equivalence, emphasizing the relationship between local properties at different places (like real and p-adic completions) and global properties over the rationals. The thesis reinforces what we understand about quadratic forms, and illustrates how the Hasse–Minkowski principle and theorem relate to algebraic number theory as a whole.

Bib
@phdthesis{tankaria2016hasse, title = {Hasse-Minkowski principle for Quadratic forms over Q}, author = {Tankaria, Hardik}, year = {2016}, journal = {Indian Institute of Technology Hyderabad}, }