Цель: разработать методику формирования видеодатасета рефлекторной мимической активности лица в группе сотрудников транспортной безопасности (ТБ) с оценкой их психофизиологического состояния, подтверждающего наличие функционального состояния утомления. ...
In this short note we explore what is needed for the unsupervised training of graph language models based on link grammars. First, we introduce the ter-mination tags formalism required to build a language model based on a link grammar formalism of Sleator and Temperley [21] and discuss the influence of context on the unsupervised learning of link grammars. Second, we pro-pose a statistical link grammar formalism, allowing for statistical language generation. Third, based on the above formalism, we show that the classical dissertation of Yuret [25] on discovery of linguistic relations using lexical at-traction ignores contextual properties of the language, and thus the approach to unsupervised language learning relying just on bigrams is flawed. This correlates well with the unimpressive results in unsupervised training of graph language models based on bigram approach of Yuret. ...
Kiosks are a popular self-service option in many fast-food restaurants, they save time for the visitors and save labor for the fast-food chains. In this paper, we propose an effective design of a kiosk shopping cart recommender system that combines a language model as a vectorizer and a neural network-based classifier. The model performs better than other models in offline tests and exhibits performance comparable to the best models in A/B/C tests. ...
Video monitoring of the patient position in the intensive care units is complicated by the obstacles covering the patient body. Conventional posture detection algorithms do not work in this case. A reformulation of the posture detection problem for the case as an object detection/image classification problem and the use of recent deep learning techniques allowed us to achieve 94.5% accuracy on a pre-clinical test classifying 4 postures using imagery from an off-the-shelf camera and edge processing, which is a 60% improvement over the result previously known in literature. This in turn allowed us to build a ready for the clinical trials system based on inexpensive off-the-shelf cameras.Clinical Relevance — A cheap and practical system of automatic video monitoring of bedridden patients allows to minimize the risks of pressure ulcer in ICU. ...
Traditionally, the Central Bank of Russia used regular expressions for the payment classification as part of its supervisory activities. Regular expressions often spanned multiple pages to cover varied relevant keywords and their forms. We compare this approach to two modern short text classification approaches: fastText and BERT-based transformer in terms of speed, accuracy and flexibility, including few-shot learning. ...
This memo describes NTR-TSU submission for SIGTYP 2021 Shared Task on predicting language IDs from speech. Spoken Language Identification (LID) is an important step in a multilingual Automated Speech Recognition (ASR) system pipeline. For many low-resource and endangered languages, only single-speaker recordings may be available, demanding a need for domain and speaker-invariant language ID systems. In this memo, we show that a convolutional neural network with a Self-Attentive Pooling layer shows promising results for the language identification task. ...