The rapid evolution of immersive experiences, particularly in Extended Reality (XR) applications like Virtual Reality (VR), Mixed Reality (MR), and Augmented Reality (AR), presents significant challenges for delivering compelling and perceptually accurate spatial audio. Traditional audio formats and rendering techniques fall short of meeting the requirements of these environments, leading to a need for novel approaches to audio representation, encoding, and delivery. Using the Apple Spatial Audio Format (ASAF) and its underlying Apple Positional Audio Codec (APAC) as a case study, we will examine how a unified framework can natively support a diverse array of immersive audio representations, including object-based audio, High Order Ambisonics (HOA), channel-based audio, and binaural rendering.