In its raw frequency variety, tf is simply the frequency in the "this" for each document. In each document, the word "this" seems at the time; but since the document two has far more words and phrases, its relative frequency is scaled-down.epoch. For that reason a Dataset.batch utilized after Dataset.repeat will produce batches that straddle epoch