Analisi di dati cristallografici e costruzione di modelli predittivi

In Cristallografia la determinazione del sistema cristallino di un composto rappresenta il primo passo da eseguire in un processo di soluzione strutturale. Nel caso di composti policristallini, questo step può rappresentare un collo di bottiglia nel workflow degli addetti ai lavori e spesso richiede un intervento manuale che presuppone notevole esperienza. Il presente lavoro propone un approccio data-driven basato su Machine Learning (ML) per la classificazione dei sistemi cristallini alternativo a quello tradizionale. I dati utilizzati sono i pattern di diffrazione X da polveri cristalline (XRPD) calcolati a partire dai file CIF (Crystallographic Information File) presenti nel database POW_COD e relativi a composti organici, inorganici e metallorganici, sviluppato dall’Istituto di Cristallografia del Centro Nazionale delle Ricerche (CNR) di Bari. Si `e proceduto con una prima analisi e Data Reduction al fine di generare un dataset surrogato che racchiudesse le informazioni rilevanti dello spettro. Il classificatore finale proposto si basa sul modello Random Forest e raggiunge un’accuratezza di circa il 60% sia sui dati di test calcolati che su alcuni dati reali. Matrici di confusione, valori di Precision, Recall, F1-Score e Curve ROC sono riportati nei risultati. Seppur sia un valore totale migliorabile, quattro classi su sette risultato altamente discriminate a conferma della validita` della metodologia proposta.

AMBITI DI RICERCA

Computing

KEYWORDS

machine learning

Riferimento

Università

Università di Bari

Corso di Studi

Laurea Magistrale in Data Science

Tipologia laurea

Triennale

Data inizio

01/09/2021

Data fine

10/03/2022

Cookie	Durata	Descrizione
_GRECAPTCHA	5 months 27 days	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wordpress_test_cookie	session	This cookie is used to check if the cookies are enabled on the users' browser.

Analisi di dati cristallografici e costruzione di modelli predittivi

AMBITI DI RICERCA

KEYWORDS

Privacy & Cookies Policy