Text this: CoST-UNet: Convolution and swin transformer based deep learning architecture for cardiac segmentation