I have done similar work in the past and will be able to help you with this. I will align the two data sets and at the same time retain the original data distribution characteristics.
I use Python for Machine Learning. Have worked on most of the Machine Learning Algorithms including Natural Language Processing, Data Wrangling, Data Visualization (Matplotlib, Seaborn and Bokeh), ensemble methods such as Random Forest, Bagging and Boosting and Deep Learning (CNN, Recurrent Neural Networks, Long Short Term Memory etc.,).