1. Li, M., & Sigal, L. (2021). Referring Transformer: A One-step Approach to Multi-task Visual Grounding. Thirty-Fifth Conference on Neural Information Processing Systems.
    Details
  2. Pang, B., Li, Y., Zhang, Y., Li, M., & Lu, C. (2020). Tubetk: Adopting tubes to track multi-object in a one-step training model. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6308–6318.
    Details
  3. Xu, X., Li, M., Sun, W., & Yang, M.-H. (2020). Learning spatial and spatio-temporal pixel aggregations for image and video denoising. IEEE Transactions on Image Processing, 29, 7153–7165.
    Details
  4. Pang, B., Li, Y., Li, J., Li, M., Cao, H., & Lu, C. (2021). TDAF: Top-Down Attention Framework for Vision Tasks. Proceedings of the AAAI Conference on Artificial Intelligence, 35(3), 2384–2392.
    Details