For a college project I need to find easy ways to create a dataset of bugs (Deep learning/ machine learning orientated) that includes bug reports from bugzilla, github and relevent posts from stackoverflow and other forums
I found that i can use similarity scoring methods like BM25 yet i am looking for more alternatives
New contributor