Publications


Main publications

2023

  1. ICAART
    Deep Learning Model Selection With Parametric Complexity Control
    Olga Grebenkova, Oleg Bakhteev, Vadim Strijov
    2023

2022

  1. AIST
    Neural architecture search with structure complexity control
    Konstantin Yakovlev, Olga Grebenkova, Oleg Bakhteev, Vadim Strijov
    In Recent Trends in Analysis of Images, Social Networks and Texts, 2022
  2. RFBR
    Digital Education Platform [In Russian]
    Цифровая платформа образования
    Oleg Bakhteev et al.
    Russian Foundation for Basic Research Journal, 2022
  3. AiT
    Gradient methods of metaparameter optimization in knowledge distillation
    Maria Gorphinich, Oleg Bakhteev, Vadim Strijov
    Automation and Remote Control, 2022
  4. ENAI
    Cross-language plagiarism detection: a case study of European universities academic works
    Oleg Bakhteev et al.
    2022
  5. ISP RAS
    Anti-Distillation: Knowledge Transfer from a Simple Model to the Complex One
    Kseniia Petrushina, Oleg Bakhteev, Andrey Grabovoy, Vadim Strijov
    In 2022 Ivannikov Ispras Open Conference (ISPRAS), 2022

2021

  1. IiEP
    Methods of cross-lingual text reuse detection in large textual collections [In Russian]
    Методы обнаружения переводных заимствований в больших текстовых коллекциях
    Rita Kuznetsova, Oleg Bakhteev, Yury Chekhovich
    Informatics and Applications, 2021
  2. IiEP
    Variational deep learning model optimization with complexity control [In Russian]
    Вариационная оптимизация модели глубокого обучения с контролем сложности
    Olga Grebenkova, Oleg Bakhteev, Vadim Strijov
    Informatics and Applications, 2021
  3. Dialogue
    Near-duplicate handwritten document detection without text recognition
    Oleg Bakhteev et al.
    In Computational Linguistics and Intellectual Technologies, 2021
  4. PAN
    Hate Speech Spreader Detection using Contextualized Word Embeddings
    Evgeny Finogeev et al.
    In CLEF (Working Notes), 2021
  5. ISP RAS
    The automatic approach for scientific papers dating
    Andrey Grabovoy, Oleg Bakhteev, Yury Chekhovich
    In Ivannikov Ispras Open Conference (ISPRAS), 2021

2020

  1. IiEP
    Ordering the set of neural network parameters [In Russian]
    Введение отношения порядка на множестве параметров аппроксимирующих моделей
    Andrey Grabovoy, Oleg Bakhteev, Vadim Strijov
    Informatics and Applications, 2020
  2. PAN
    Fake news spreader detection using neural tweet aggregation
    Oleg Bakhteev, Aleksandr Ogaltsov, Petr Ostroukhov
    In CLEF (Working Notes), 2020

2019

  1. IiEP
    Estimation of the relevance of the neural network parameters [In Russian]
    Определение релевантности параметров нейросети
    Andrey Grabovoy, Oleg Bakhteev, Vadim Strijov
    Informatics and Applications, 2019
  2. AOR
    Comprehensive analysis of gradient-based hyperparameter optimization algorithms
    Oleg Bakhteev, Vadim Strijov
    Annals of Operations Research, 2019

2018

  1. AiT
    Deep Learning Model Selection of Suboptimal Complexity
    Oleg Bakhteev, Vadim Strijov
    Automation and Remote Control, 2018
  2. IiEP
    Optimal recurrent neural network model in paraphrase detection [In Russian]
    Выбор оптимальной модели рекуррентной сети в задачах поиска парафраза
    Anton Smerdov, Oleg Bakhteev, Vadim Strijov
    Informatics and Applications, 2018
  3. IiEP
    Automatic metadata extraction from scientific PDF documents [In Russian]
    Автоматическое извлечение метаданных из научных PDF-документов
    Aleksandr Ogaltsov, Oleg Bakhteev
    Informatics and Applications, 2018

2017

  1. PAN
    Author Masking using Sequence-to-Sequence Models
    Oleg Bakhteev, Andrey Khazov
    In CLEF (Working Notes), 2017
  2. A method for artificial and unscientific texts detection in a large document collection [In Russian]
    Об одном методе детектирования искусственных и ненаучных текстов в обширной коллекции документов
    Oleg Bakhteev, Rita Kuznetsova, Alexey Romanov, Yury Chekhovich
    Russian Digital Libraries Journal, 2017
  3. Plagiarism in scientific papers: challenges in translation detection [In Russian]
    Плагиат в научных статьях: трудности обнаружения перевода
    Yury Chekhovich, Rita Kuznetsova, Oleg Bakhteev
    Universitetskaya kniga [University book], 2017

2016

  1. Dialogue
    Machine-translated text detection in a collection of Russian Scientific papers
    Alexey Romanov, Rita Kuznetsova, Oleg Bakhteev, Anton Khritankov
    Dialogue-21, 2016
  2. Systems and means of deep learning classification problems [In Russian]
    Системы и средства глубокого обучения в задачах классификации
    Oleg Bakhteev, Maria Popova, Vadim Strijov
    Sistemy i Sredstva Informatiki [Systems and Means of Informatics], 2016

2015

  1. JMLDA
    Panel matrix and ranking model recovery using mixed-scale measured data
    Oleg Bakhteev
    Journal of machine learning and data analysis, 2015
  2. JMLDA
    Handling missing values in mixed-scale datasetswith large amount of missing values [In Russian]
    Восстановление пропущенных значений в разнородных шкалах с большим числом пропусков
    Oleg Bakhteev
    Journal of machine learning and data analysis, 2015
  3. AINL
    A monolingual approach to detection of text reuse in Russian-English collection
    Oleg Bakhteev, Rita Kuznetsova, Alexey Romanov, Anton Khritankov
    In 2015 Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference (AINL-ISMW FRUCT), 2015

Talks

2024

  1. Polmeth
    After the enfranchisement: Legislative elites’ reactions to women’s inclusion
    Laurence Brandeberger, Jorge M. Fernandes, Sophia Schlosser, Oleg Bakteev, Luis Salamanca
    2024
    Speaker: Laurence Brandeberger
    California, USA

2023

  1. ICAART
    Deep Learning Model Selection With Parametric Complexity Control
    Olga Grebenkova, Oleg Bakhteev, Vadim Strijov
    2023
    Speaker: Olga Grebenkova
    Online presentation

2022

  1. IDP
    Concordant neural architecture search on multi-domain data [In Russian]
    Поиск согласованных нейросетевых моделей в задаче мультидоменного обучения
    Yakovlev, Konstantin, Bakhteev, Oleg, Strijov, Vadim
    2022
    Speaker: Konstantin Yakovlev
    Moscow
  2. IDP
    Methods of near-duplicate handwritten document search in large collections of texts [In Russian]
    Методы поиска почти-дубликатов рукописных документов в больших коллекциях текстов
    Oleg Bakhteev et al.
    2022
    Speaker: Anrey Grabovoy
    Moscow
  3. ISP
    RuGECToR: rule based error correction model for Russian language [In Russian]
    RuGECToR: нейросетевая модель на основе правил для исправления грамматических ошибок на русском языке
    Ildar Khabutdinov et al.
    2022
    Speaker: Ildar Khabutdinov
    Moscow
  4. ISP
    Anti-Distillation: Knowledge Transfer from a Simple Model to the Complex One [In Russian]
    Антидистилляция: перенос знаний от простой модели к более сложной
    Petrushina, Kseniia, Bakhteev, Oleg, Grabovoy, Andrey abd Strijov, Vadim
    2022
    Speaker: Kseniia Petrushina
    Moscow
  5. ENAI
    Image reuse detection in large-scale document scientific collection
    Oleg Bakhteev et al.
    2022
    Speaker: Mariam Kaprielova
    Online conference

2021

  1. MMPR
    Image reuse detection in large-scale document scientific collection [In Russian]
    Поиск заимствованных изображений в больших коллекциях научных документов
    Evgeny Finogeev et al.
    2021
    Speaker: Evgeny Finogeev
    Online conference
  2. MMPR
    Multi-task Learning in the Problem of Rubrication of Scientific Documents [In Russian]
    Многозадачное обучение в задаче рубрикации научных документов
    Oleg Shevchenko et al.
    2021
    Speaker: Oleg Shevchenko
    Online conference
  3. MMPR
    Model selection using Bayesian hypernetworks [In Russian]
    Порождение моделей заданной сложности с использованием байесовских гиперсетей
    Oleg Grebenkova, Oleg Bakhteev, Vadim Strijov
    2021
    Speaker: Olga Grevenkova
    Online conference
  4. MMPR
    Gradient-based metaparameter optimization in knowledge distillation task [In Russian]
    Градиентные методы оптимизации метапараметров в задаче дистилляции знаний
    Mariya Gorpinich, Oleg Bakhteev, Vadim Strijov
    2021
    Speaker: Mariya Gorpinich
    Online conference
  5. MMPR
    Differentiable architecture search with model complexity control [In Russian]
    Дифференцируемый алгоритм поиска архитектуры с контролем сложности
    Konstantin Yakovlev et al.
    2021
    Speaker: Konstantin Yakovlev
    Online conference
  6. ISP RAS
    The automatic approach for scientific papers dating
    Andrey Grabovoy, Oleg Bakhteev, Yury Chekhovich
    2021
    Speaker: Andrey Grabovoy
    Moscow, Russia
  7. MIPT
    Metaparameter optimization in knowledge distatillation task [In Russian]
    Оптимизация метапараметров в задаче дистилляции знаний
    Mariya Gorpinich, Oleg Bakhteev, Vadim Strijov
    2021
    Speaker: Mariya Gorpinich
    Moscow, Russia
  8. MIPT
    Deep learning neural architecture selection with complexity control [In Russian]
    Выбор архитектуры модели с контролем сложности
    Konstantin Yakovlev et al.
    2021
    Speaker: Konstantin Yakovlev
    Moscow, Russia
  9. Dialogue
    Near-duplicate handwritten document detection without text recognition
    Oleg Bakhteev et al.
    2021
    Speaker: Oleg Bakhteev
    Online conference
  10. ENAI
    Cross-language plagiarism detection: a case study of European universities academic works
    Oleg Bakhteev et al.
    2021
    Speaker: Oleg Bakhteev
    Online conference

2020

  1. MIPT
    Deep learning model generation using Bayesian hypernetworks [In Russian]
    Порождение моделей глубокого обучения с использованием байесовских гиперсетей
    Oleg Grebenkova, Oleg Bakhteev, Vadim Strijov
    2020
    Speaker: Olga Grebenkova
    Moscow, Russia
  2. IDP
    Near-duplicate detection in handwritten school essays [In Russian]
    Поиск почти-дубликатов в рукописных текстах школьных сочинений
    Oleg Bakhteev et al.
    2020
    Speaker: Oleg Bakhteev
    Online conference

2018

  1. IDP
    Bayesian deep learning optimal model structure selection [In Russian]
    Байесовский выбор наиболее правдоподобной структуры модели глубокого обучения
    Oleg Bakhteev
    2018
    Speaker: Oleg Bakhteev
    Gaeta, Italy
  2. IDP
    Estimation of the relevance of the neural network parameters [In Russian]
    Определене релевантности параметров нейросети
    Andrey Grabovoy, Oleg Bakhteev, Vadim Strijov
    2018
    Speaker: Andrey Grabovoy
    Gaeta, Italy

2017

  1. CLEF
    Author Masking using Sequence-to-Sequence Models
    Oleg Bakhteev, Andrey Khazov
    2017
    Speaker: Oleg Bakhteev
    Dublin, Ireland
  2. MMPR
    Gradient-based methods of deep learning hyperparameters optimization [In Russian]
    Градиентные методы оптимизации гиперпараметров моделей глубокого обучения
    Oleg Bakhteev
    2017
    Speaker: Oleg Bakhteev
    Taganrog, Russia
  3. MMPR
    Cross-lingual text reuse detection in scientific papers included in RSCI [In Russian]
    Детектирование переводных заимствований в текстах научных статей из журналов, входящих в РИНЦ
    Oleg Bakhteev, Rita Kuznetsova
    2017
    Speaker: Oleg Bakhteev
    Taganrog, Russia

2015

  1. AINL
    A monolingual approach to detection of text reuse in Russian-English collection
    Oleg Bakhteev, Rita Kuznetsova, Alexey Romanov, Anton Khritankov
    2015
    Speaker: Oleg Bakhteev
    Saint Petersburg, Russia

2014

  1. MIPT
    Panel matrix and ranking model recovery using mixed-scale measured data [In Russian]
    Восстановление панельной матрицы и ранжирующей модели по метризованной выборке в разнородных шкалах
    Oleg Bakhteev
    2014
    Speaker: Oleg Bakhteev
    Moscow, Russia

Workshops and poster sessions

2021

  1. Automated architecture search with model complexity control
    Konstantin Yakovlev et al.
    2021
    Speaker: Konstantin Yakovlev
    ECMLPKDD Workshop on Automating Data Science
  2. Multi-Modeling and Deep Learning Model Selection
    Oleg Bakhteev, Vadim Strijov
    2021
    Speaker: Oleg Bakhteev
    MIPT-UGA AI workshop

2019

  1. NeurIPS
    CrossLang: The System of Cross-lingual Plagiarism Detection
    Oleg Bakhteev, Aleksandr Ogaltsov, Andrey Khazov, Kamil Safin, Rita Kuznetsova
    2019
    NeurIPS, Workshop on Systems for ML
  2. NeurIPS
    CrossLang: The System of Cross-lingual Plagiarism Detection
    Oleg Bakhteev, Aleksandr Ogaltsov, Andrey Khazov, Kamil Safin, Rita Kuznetsova
    2019
    Speaker: Aleksandr Ogaltsov
    NeurIPS, Document Intelligence workshop
  3. KDD
    CrossLang: The System of Cross-lingual Plagiarism Detection
    Oleg Bakhteev, Aleksandr Ogaltsov, Andrey Khazov, Kamil Safin, Rita Kuznetsova
    2019
    Speaker: Kamil Safin
    KDD, Truth Discovery and Fact Checking: Theory and Practice workshop
  4. KDD
    CrossLang: The System of Cross-lingual Plagiarism Detection
    Oleg Bakhteev, Aleksandr Ogaltsov, Andrey Khazov, Kamil Safin, Rita Kuznetsova
    2019
    Speaker: Kamil Safin
    KDD, Workshop on Deep Learning for Education (DL4Ed)

2018

  1. NeurIPS
    Variational Bi-domain Triplet Autoencoder
    Rita Kuznetsova, Oleg Bakhteev
    2018
    Speaker: Rita Kuznetsova
    NeurIPS, Visually Grounded Interaction and Language workshop
  2. NeurIPS
    Variational Bi-domain Triplet Autoencoder
    Rita Kuznetsova, Oleg Bakhteev
    2018
    Speaker: Rita Kuznetsova
    NeurIPS, Visually Grounded Interaction and Language workshop
  3. NeurIPS
    Variational Bi-domain Triplet Autoencoder
    Rita Kuznetsova, Oleg Bakhteev
    2018
    Speaker: Rita Kuznetsova
    NeurIPS, Relational Representation Learning workshop
  4. KDD
    Variational Bi-domain Triplet Autoencoder
    Rita Kuznetsova, Oleg Bakhteev
    2018
    KDD Deep learning day workshop
  5. KDD
    ParaPlagDet: The system of paraphrased plagiarism detection
    Rita Kuznetsova, Oleg Bakhteev, Andrey Khazov, Aleksandr Ogaltsov
    2018
    Speaker: Rita Kuznetsova
    KDD BigScholar workshop
  6. Machine learning methods for fiscal data analysis [In Russian]
    Методы машинного обучения для анализа фискальных данных
    Oleg Bakhteev
    2018
    Workshop at Educational Сenter "Sirius", Russia

2015

  1. RuSSIR
    Explicit Semantic Analysis for Cross-Language Retrieval in Case of Russian-English Translation
    Oleg Bakhteev, Alexey Romanov, Rita Kuznetsova
    2015
    Russian summer school in information retrieval, poster session

Other

2024

  1. Medium
    Godot, HTML5, and Neural Networks: Yet Another Way to Reinvent the Wheel
    Oleg Bakhteev
    2024
    Blog-post
  2. OSF
    Introducing embed2discover: A tool for semi-automated, dictionary-based content-analysis
    Laurence Brandenberger, Oleg Bakhteev, Jorge M. Fernandez, Sophia Schlosser, Luis Salamanca
    2024
  3. Student
    Predictive Models for Motor Pattern Recognition
    Johannes Kjær
    2024
    Teaching assistants at student project at EPFL: Oleg Bakhteev and Leonid Iosipoi. Professor: Guillaume Obozinski
  4. Student
    Representation learning for motor pattern recognition during Inertial Measurement Unit
    Daria Yakovchuk
    2024
    Teaching assistants at student project at EPFL: Oleg Bakhteev and Leonid Iosipoi. Professor: Guillaume Obozinski

2023

  1. Project
    EvolvingDemocraSci: Advancing parliamentary data analysis
    2023
    A project page at Swiss Data science center
  2. Student
    LLM4SciLit - Large Language Models for Information Retrieval in Scientific Literature
    Tommaso Martorella
    2023
    Teaching assistant at student project at EPFL: Oleg Bakhteev. Professor: Guillaume Obozinski
  3. Project
    STIMO: Personalized epidural electrical stimulation of the lumbar spinal cord for clinically applicable therapy to restore mobility after paralyzing spinal cord injury,
    2023
    A project page at Swiss Data science center

2022

  1. Project
    Software for air quality assessment and calculation of atmospheric transport processes [In Russian]
    Программное обеспечение для оценки качества атмосферного воздуха и расчетов процессов переноса в атмосфере
    2022
    A project page at Skoltech
  2. BSc
    Metaparameter optimization in knowledge distillation problem, Bachelor thesis, MIPT [In Russian]
    Оптимизация метапараметров в задаче дистилляции знаний, бакалаврский диплом, МФТИ
    Maria Gorpinich
    2022
    Scientific adviser: Oleg Bakhteev, PhD
  3. BSc
    Concordant neural model selection with complexity control, Bachelor thesis at MIPT [In Russian]
    Выбор согласованных нейросетевых моделей с контролем сложности, бакалаврский диплом, МФТИ
    Konstantin Yakovlev
    2022
    Scientific adviser: Oleg Bakhteev, PhD
  4. MSc
    Bayesian neural architecture model selectionб Master thesis at MIPT [In Russian]
    Байесовский выбор архитектуры нейросетевой модели, диплом магистра, МФТИ
    Anton Sotnikov
    2022
    Scientific adviser: Oleg Bakhteev, PhD
  5. MSc
    An adversarial method for neural network fine-tuning for transfer learning problem, Master thesis at MIPT
    Aleksandr Kolesov
    2022
    Scientific adviser: Oleg Bakhteev, PhD

2021

  1. BSc
    An investigation of gradient-based methods for neural architecture search, Bachelor thesis at MIPT [In Russian]
    Исследование методов поиска структур нейронной сети на основе градиентного поиска, бакалаврский диплом, МФТИ
    Valeria Sherbakova
    2021
    Scientific consultant: Oleg Bakhteev, PhD
  2. BSc
    Model generation with complexity control using Bayesian hypernetworks, Bachelor thesis at MIPT [In Russian]
    Порождение моделей заданной сложности с использованием байесовских гиперсетей, бакалаврский диплом, МФТИ
    Olga Grebenkova
    2021
    Scientific adviser: Oleg Bakhteev, PhD

2020

  1. PhD
    Bayesian suboptimal deep learning structure selection, PhD thesis [In Russian]
    Байесовский выбор субоптимальной структуры модели глубокого обучения, диссертация к.ф.-м. н.
    Oleg Bakhteev
    2020
    Scientific adviser: Vadim Strijov, DSc
  2. arXiv
    Variational learning across domains with triplet information
    Rita Kuznetsova, Oleg Bakhteev, Alexandr Ogaltsov
    2020
  3. Habr
    Klingon language tutorial [In Russian]
    Самоучитель клингонского
    Antiplagiat company
    2020
    It-blog. Article co-author.

2019

  1. Software
    Cross-lingual textual reuse detection module for English-Russian language pair [In Russian]
    Модуль поиска переводных текстовых заимствований с русского на английский язык
    Oleg Bakhteev et al.
    2019
    Software registration certificate

2018

  1. Habr
    How Antiplagiat detects paraphrased text [In Russian]
    «Трое в лодке, нищета и собаки», или как Антиплагиат ищет парафраз
    Antiplagiat company
    2018
    It-blog. Article co-author.
  2. Habr
    An overview of autoencoders application in text analysis [In Russian]
    «Туда и обратно» для нейронных сетей, или обзор применений автокодировщиков в анализе текстов
    Antiplagiat company
    2018
    It-blog. Article co-author.
  3. Habr
    Challenges in translation: how to find a cross-lingual plagiarism from Russian into English [In Russian]
    Трудности перевода: как найти плагиат с английского языка в русских научных статьях
    Antiplagiat company
    2018
    It-blog. Article co-author.

Datasets and supplementary materials

2023

  1. Dataset
    Dataset of handwritten essays, 2021 (reuploaded)
    Oleg Bakhteev et al.
    2023
    Supplementary material for the article "Near-duplicate handwritten document detection without text recognition"

2022

  1. Dataset
    Cross-lingual dataset, 2019 (reuploaded)
    Oleg et al. Bakhteev
    2022
    Supplementary material for the article "CrossLang: the system of cross-lingual plagiarism detection"
  2. Dataset
    Results of cross-lingual text reuse detection among European Universities
    Oleg Bakhteev et al.
    2022
    Supplementary material for the article "Cross-language plagiarism detection: a case study of European languages academic works"
  3. Dataset
    Synthetic dataset for cross-lingual text reuse detection evaluation
    Oleg Bakhteev et al.
    2022
    Supplementary material for the article "Cross-language plagiarism detection: a case study of European languages academic works"

2021

  1. Dataset
    Bibliography dataset (reuploaded)
    Aleksandr Ogaltsov
    2021
    Not my paper! Supplementary material for the article "Language-Free Regular Expression Search of Document’s References"
  2. Dataset
    Open access scientific documents from elibrary.ru
    Andrey Grabovoy, Oleg Bakhteev, Yury Chekhovich
    2021
    Supplementary material for the article "The automatic approach for scientific papers dating"