Invariant Features in Language Models: Geometric Characterization and Model Attribution
Under review
We propose a local geometric framework for identifying invariant semantic subspaces in transformer-based language models, with applications to zero-shot model attribution across diverse LLM architectures.