Implementation details of Backpropagation in Siamese networks. [D]

Hey Folks, Could someone please share correct implementation of backprop in siamese networks? The explanation on the original paper is not super detailed. I found this random implementation on github, ref. The inputs are passed one after the other, loss is computed for the last two inputs and the weight is updated after. Is this the correct implementation? Another implementation I could think of is to have two copies of same network like Bi-encoder.