Primeros casos prácticos de la modernización del proceso de depuración selectiva mediante técnicas de Machine Learning

S. Barragán Andrés, D. Salgado Fernández

La depuración es una parte fundamental del proceso de producción en la estadística oficial para poder garantizar la calidad y precisión de los datos pero tradicionalmente consume mucho tiempo y recursos. La presente contribución se enmarca en la depuración selectiva, claramente orientada hacia la eficiencia de recursos, que además permite la automatización del proceso de depuración, en este caso, usando técnicas de Machine Learning con una implementación modular y estandarizada.

Se han llevado a cabo varios casos prácticos de aplicación de estas técnicas incluyendo variables categóricas, continuas y semicontinuas; haciendo uso de modelos tanto de clasificación como de regresión (con random forests y boosting). En concreto, esta contribución se centra en la aplicación en producción de la depuración de la variable categórica Ocupación en la Encuesta Europea de Salud donde se obtuvieron muy buenos resultados en términos de optimización de recursos y calidad.

Keywords: Producción estadística oficial depuración selectiva Machine Learning

Scheduled

XIII Public Statistics Conference. Applications of new statistical methods and new sources for official statistical production (II)

June 10, 2022 10:10 AM

Cloister room

Other papers in the same session

Corrección de la falta de respuesta en encuestas panel. Aplicación a la “Encuesta social 2018. Educación y transiciones al mercado laboral en Andalucía”

M. Escudero Tena, M. Velasco Fernández-Nieto

Estimación Flash del Índice de Costes Laborales Armonizado con técnicas de Machine Learning

C. Sáez Calvo, M. Novás Filgueira, L. Sanguiao Sande

Grado de urbanización en Cataluña: divisiones administrativas frente a cuadrículas de alta resolución

C. Hormigos Feliu, E. Suñé Luis, D. Ibáñez, M. Farré

La calidad percibida de la estadística oficial: algunas evidencias desde el sistema estadístico de Cataluña

E. Ripoll Font, J. Galter

Latest news

6/8/22
Visit Alhambra

Tomorrow, Thursday, we will visit the Alhambra. 1. At 7:00 p.m., buses will start leaving from the door of the Hotel Granada Center (in front of the door of the Faculty of Sciences) 2. Upon arrival, groups with guides will be organized. 3. There will be groups in English language. These groups will wait at the destination until the arrival of all buses. 4. After the visit to the Alhambra we will walk to the Carmen de los Mártires to have a cocktail. 5. On the way back, we can go back by bus, which will leave from the same place where they left us at the Alhambra. 6. Although the way to the Alhambra by bus seems long, it really is close to the city. We invite you, whoever wishes, to go down for a pleasant walk. 7. Do not forget to take your Alhambra ticket and your identity card or passport. The document must be the same whose number matches the ticket number you have.
5/31/22
YouTube Channel

The menu label is already on the web from where you can see the live broadcasts: inauguration, plenary sessions, round tables,... Don't forget to enter the YouTube channel and subscribe!!!!
5/27/22
Gala Dinner

It opens the possibility that anyone who is registered for the conference, and wants non-registered companions to attend the gala dinner, can do so by paying the price of this dinner (80 euros VAT included). This payment must be made before June 1 (inclusive). To do this, contact with [email protected]
5/23/22
IMPORTANT WARNING

To access the Alhambra on the visit on Thursday 9, it is obligatory to present the identification document whose reference was sent to the organization (eurocongress)
5/16/22
Social Events

All social events are included with the payment of the corresponding registration.

Primeros casos prácticos de la modernización del proceso de depuración selectiva mediante técnicas de Machine Learning

Other papers in the same session

Latest news

Organizers

Partners

Sponsors

Primeros casos prácticos de la modernización del proceso de depuración selectiva mediante técnicas de Machine Learning

Other papers in the same session

Latest news

Organizers

Partners

Sponsors

Cookie policy