vST for Protein Language Models#
References#
This appendix lists references relevant to protein language models, high‑dimensional embedding analysis, scaling laws, structural biology, and validation frameworks. Citations are grouped by category for clarity and presented in a substrate‑agnostic, model‑independent format consistent with the RSM and vST canon.
1. Protein Language Models and Sequence Embeddings#
-
Rives, A., Meier, J., Sercu, T., et al.
Biological Structure and Function Emerge from Scaling Unsupervised Learning to 250 Million Protein Sequences.
PNAS 118, e2016239118 (2021). -
Elnaggar, A., Heinzinger, M., Dallago, C., et al.
ProtTrans: Towards Cracking the Language of Life’s Code Through Self‑Supervised Deep Learning and High Performance Computing.
IEEE TPAMI (2021). -
Rao, R., Liu, J., Verkuil, R., et al.
MSA Transformer.
ICML (2021). -
Madani, A., McCann, B., Naik, N., et al.
ProGen: Language Modeling for Protein Generation.
arXiv:2004.03497 (2020).
2. Structural Biology and Protein Representation#
-
Jumper, J., Evans, R., Pritzel, A., et al.
Highly Accurate Protein Structure Prediction with AlphaFold.
Nature 596, 583–589 (2021). -
Baek, M., DiMaio, F., Anishchenko, I., et al.
Accurate Prediction of Protein Structures and Interactions Using a Three‑Track Neural Network.
Science 373, 871–876 (2021). -
AlQuraishi, M.
End‑to‑End Differentiable Learning of Protein Structure.
Cell Systems 8, 292–301 (2019).
3. High‑Dimensional Modeling and Representation Learning#
-
Bengio, Y., Courville, A., & Vincent, P.
Representation Learning: A Review and New Perspectives.
IEEE TPAMI 35, 1798–1828 (2013). -
Coifman, R. R., & Lafon, S.
Diffusion Maps.
Applied and Computational Harmonic Analysis 21, 5–30 (2006). -
Tenenbaum, J. B., de Silva, V., & Langford, J. C.
A Global Geometric Framework for Nonlinear Dimensionality Reduction.
Science 290, 2319–2323 (2000).
4. Scaling Laws and Model Dynamics#
-
Kaplan, J., McCandlish, S., Henighan, T., et al.
Scaling Laws for Neural Language Models.
arXiv:2001.08361 (2020). -
Hoffmann, J., Borgeaud, S., Mensch, A., et al.
Training Compute‑Optimal Large Language Models.
arXiv:2203.15556 (2022). -
Bahri, Y., Kadmon, J., Pennington, J., et al.
Statistical Mechanics of Deep Learning.
Annual Review of Condensed Matter Physics 11, 501–528 (2020).
5. Regime Behavior, Stability, and Dynamics#
-
Strogatz, S.
Nonlinear Dynamics and Chaos.
Westview Press (2014). -
Ott, E.
Chaos in Dynamical Systems.
Cambridge University Press (2002). -
Guckenheimer, J., & Holmes, P.
Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields.
Springer (1983).
6. Validation, Drift Detection, and ML Systems#
-
Breck, E., Cai, S., Nielsen, E., et al.
The ML Test Score: A Rubric for ML Production Readiness and Technical Debt Reduction.
Google Research (2017). -
Sculley, D., Holt, G., Golovin, D., et al.
Hidden Technical Debt in Machine Learning Systems.
NIPS (2015). -
Amershi, S., Begel, A., Bird, C., et al.
Software Engineering for Machine Learning: A Case Study.
ICSE‑SEIP (2019).
7. Substrate‑Level and Triadic‑Frameworks Canon#
-
Loswin, N.
Resonance Substrate Model (RSM): Structural Foundations for High‑Dimensional Inference.
TriadicFrameworks (2025). -
Loswin, N.
Triadic Dimensional Cores: A 3D–9D Substrate for Structural and Inference‑Level Alignment.
TriadicFrameworks (2025). -
Loswin, N.
Validation‑Space‑Time (vST): A Substrate‑Level Framework for Reproducibility and Drift Detection.
TriadicFrameworks (2025). -
Loswin, N.
Dimensional Substrate Structures: Scaling Laws and High‑Dimensional Regimes.
TriadicFrameworks (2026). -
Loswin, N.
vST for Protein Language Models.
TriadicFrameworks (2026).