Improving speech enhancement algorithms by incorporating visual information