Search
morrislee
Sep 24, 20211 min read
Replace people and objects in street scene images including proper shadows
Replace people and objects in street scene images including proper shadows Repopulating Street Scenes arXiv paper abstract...
52 views0 comments
morrislee
Sep 23, 20211 min read
From image directly output text labels and coordinates of detected objects
From image directly output text labels and coordinates of detected objects Pix2seq: A Language Modeling Framework for Object Detection...
9 views0 comments
morrislee
Sep 22, 20211 min read
Reinforcement learning for learning multi-step tasks on new objects in images
Reinforcement learning for learning multi-step tasks on new objects in images Example-Driven Model-Based Reinforcement Learning for...
3 views0 comments
morrislee
Sep 21, 20211 min read
3DETR transformer for 3D Object Detection
3DETR transformer for 3D Object Detection An End-to-End Transformer Model for 3D Object Detection arXiv paper abstract...
16 views0 comments
morrislee
Sep 20, 20211 min read
Real-time face distance and iris track on mobile phone without depth sensor
Real-time face distance and iris track on mobile phone without depth sensor MediaPipe Iris: Real-time Iris Tracking & Depth Estimation...
30 views0 comments
morrislee
Sep 17, 20211 min read
Better 3D pose estimates in video by dynamically learning joint relationships
Better 3D pose estimates in video by dynamically learning joint relationships Learning Dynamical Human-Joint Affinity for 3D Pose...
39 views0 comments
morrislee
Sep 16, 20211 min read
Unsupervised learning of image classes from dynamic video stream
Unsupervised learning of image classes from dynamic video stream Online Unsupervised Learning of Visual Representations and Categories...
8 views0 comments
morrislee
Sep 15, 20211 min read
Real-time 3D hand reconstruction from a single monocular image
Real-time 3D hand reconstruction from a single monocular image Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction arXiv...
13 views0 comments
morrislee
Sep 14, 20211 min read
Get depth, regions, and layout from panoramic image quickly and accurately with horizontal features
Get depth, regions, and layout from panoramic image quickly and accurately with horizontal features HoHoNet: 360 Indoor Holistic...
12 views0 comments
morrislee
Sep 13, 20211 min read
Use satellite images to get 3D structure of buildings and roofs
Use satellite images to get 3D structure of buildings and roofs Automated LoD-2 Model Reconstruction from Very-HighResolution...
11 views0 comments
morrislee
Sep 10, 20211 min read
Image classification without normalization that is faster and better than with normalization
Image classification without normalization that is faster and better than with normalization High-Performance Large-Scale Image...
14 views0 comments
morrislee
Sep 9, 20211 min read
Image segmentation of objects and regions using transformers
Image segmentation of objects and regions using transformers Panoptic SegFormer arXiv paper abstract https://arxiv.org/abs/2109.03814...
9 views0 comments
morrislee
Sep 8, 20211 min read
Using an audio and vision transformer to count crowds
Using an audio and vision transformer to count crowds Audio-Visual Transformer Based Crowd Counting arXiv paper abstract...
15 views0 comments
morrislee
Sep 7, 20211 min read
Survey on improving efficiency of computer vision recogntion using deep learning
Survey on improving efficiency of computer vision recogntion using deep learning Efficient Visual Recognition with Deep Neural Networks:...
9 views0 comments
morrislee
Sep 3, 20211 min read
Efficiently identify people in image in one stage without separate detection step
Efficiently identify people in image in one stage without separate detection step Efficient Person Search: An Anchor-Free Approach arXiv...
14 views0 comments
morrislee
Sep 2, 20211 min read
Answer question about an image using text in scene to find external knowledge
Answer question about an image using text in scene to find external knowledge External Knowledge Augmented Text Visual Question Answering...
16 views0 comments
morrislee
Sep 1, 20211 min read
Camera looking at blank wall can determine number of people and activity
Camera looking at blank wall can determine number of people and activity What You Can Learn by Staring at a Blank Wall arXiv paper...
15 views0 comments
morrislee
Aug 31, 20211 min read
Super-resolution video using non-neighboring frames without frame alignment
Super-resolution video using non-neighboring frames without frame alignment Memory-Augmented Non-Local Attention for Video...
16 views0 comments
morrislee
Aug 30, 20211 min read
More accurate action recognition in videos by using the identities of people
More accurate action recognition in videos by using the identities of people Identity-aware Graph Memory Network for Action Detection...
16 views0 comments
morrislee
Aug 27, 20211 min read
Restore image by removing artifacts superimposed in an unknown manner
Restore image by removing artifacts superimposed in an unknown manner Blind Image Decomposition arXiv paper abstract...
11 views0 comments