Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/3184
DC FieldValueLanguage
dc.contributor.authorPanos, Brandonen_US
dc.contributor.authorMilić, Ivanen_US
dc.date.accessioned2026-02-25T08:09:43Z-
dc.date.available2026-02-25T08:09:43Z-
dc.date.issued2026-01-01-
dc.identifier.urihttps://research.matf.bg.ac.rs/handle/123456789/3184-
dc.description.abstractWe present a novel reinforcement learning (RL) approach for solving the classical 2-level atom non-LTE radiative transfer problem by framing it as a control task in which an RL agent learns a depth-dependent source function $S(\tau)$ that self-consistently satisfies the equation of statistical equilibrium (SE). The agent’s policy is optimized entirely via reward-based interactions with a radiative transfer engine, without explicit knowledge of the ground truth. This method bypasses the need for constructing approximate lambda operators ($\Lambda ^{*}$) common in accelerated iterative schemes. Additionally, it requires no extensive precomputed labelled data sets to extract a supervisory signal, and avoids backpropagating gradients through the complex RT solver itself. Finally, we show through experiment that a simple feedforward neural network trained greedily cannot solve for SE, possibly due to the moving target nature of the problem. Our $\Lambda ^{*}-\text{Free}$ method offers potential advantages for complex scenarios (e.g. atmospheres with enhanced velocity fields, multidimensional geometries, or complex microphysics) where $\Lambda ^{*}$ construction or solver differentiability is challenging. Additionally, the agent can be incentivized to find more efficient policies by manipulating the discount factor, leading to a reprioritization of immediate rewards. If demonstrated to generalize past its training data, this RL framework could serve as an alternative or accelerated formalism to achieve SE. To the best of our knowledge, this study represents the first application of reinforcement learning in solar physics that directly solves for a fundamental physical constraint.en_US
dc.language.isoenen_US
dc.publisherOxford University Pressen_US
dc.relation.ispartofRas Techniques and Instrumentsen_US
dc.subjectalgorithmsen_US
dc.subjectnumerical methodsen_US
dc.subjectradiative transfer - machine learning - reinforcement learningen_US
dc.subjectsimulationsen_US
dc.titleSolving the 2-level atom non-LTE problem using soft actor-critic reinforcement learningen_US
dc.typeArticleen_US
dc.identifier.doi10.1093/rasti/rzag005-
dc.identifier.scopus2-s2.0-105029806197-
dc.identifier.isi001681352600001-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/105029806197-
dc.contributor.affiliationAstronomyen_US
dc.relation.issn2752-8200en_US
dc.description.rankM20/M50en_US
dc.relation.firstpageArticle no. rzag005en_US
dc.relation.volume5en_US
item.grantfulltextnone-
item.languageiso639-1en-
item.fulltextNo Fulltext-
item.openairetypeArticle-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
crisitem.author.deptAstronomy-
crisitem.author.orcid0000-0002-0189-5550-
Appears in Collections:Research outputs
Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.