unrealcv

Open Source

# UnrealCV [![Join the chat at https://gitter.im/unrealcv/unrealcv](https://badges.gitter.im/unrealcv/unrealcv.svg)](https://gitter.im/unrealcv/unrealcv?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge) [![Docs Status](https://readthedocs.org/projects/unrealcv/badge/?version=latest )](http://docs.unrealcv.org)  UnrealCV is a project to help computer vision researchers build virtual worlds using Unreal Engine (UE). It extends UE with a plugin by providing: 1. A set of UnrealCV commands to interact with the virtual world. 2. Communication between UE and an external program, such as Pytorch/Tensorflow. UnrealCV can be used in two ways. - The first one is using a compiled game binary with UnrealCV embedded. This is as simple as running a game, no knowledge of Unreal Engine is required. - The second is installing the UnrealCV plugin into Unreal Engine and using the editor to build a new virtual world. Please read [Tutorial: Getting Started](http://unrealcv.github.io/tutorial/getting_started.html) to learn using UnrealCV. <center> <img src="http://unrealcv.github.io/images/homepage_teaser.png" alt="annotation"/> Images generated from the technical demo <a href="http://docs.unrealcv.org/en/master/reference/model_zoo.html#realisticrendering">RealisticRendering</a><br> </center> ## New Features - Support Unreal Engine 5.6 (recommended). - Optical flow image capture: `vget /camera/[id]/optical_flow [format]`. - Call any Blueprint function from Python by `vbp [obj_name] [func_name] [arg1] [arg2] ...` command. - Support RPC communication between Server and Client in Linux, higher FPS and more reliable. - A set of new commands for camera control and object manipulation, please refer to [command system](https://docs.unrealcv.org/en/latest/reference/commands.html) for more details. ## How to install UnrealCV To install the UnrealCV `Server`, you need: 1. Download the source code and place it on the ``Plugin`` folder of a C++ UE project. 2. Launch the C++ project with Visual Studio, UnrealCV will be compiled at the same time. Note that visual studio version should be compatible with your [UE version](https://dev.epicgames.com/documentation/en-us/unreal-engine/setting-up-visual-studio-development-environment-for-cplusplus-projects-in-unreal-engine). 3. To check the success installation of UnrealCV, you can run ``vget /unrealcv/status`` in the console (Press **`** to display the console). To install the UnrealCV `Client`, just run: ``pip install unrealcv`` > **🚩 Note:** More pre-built UE binaries with UnrealCV can be found in the [UnrealZoo](http://unrealzoo.site/). ## Citation If you found this project useful, please consider citing our paper ```bibtex @article{qiu2017unrealcv, Author = {Weichao Qiu, Fangwei Zhong, Yi Zhang, Siyuan Qiao,Zihao Xiao, Tae Soo Kim, Yizhou Wang, Alan Yuille}, Journal = {ACM Multimedia Open Source Software Competition}, Title = {UnrealCV: Virtual Worlds for Computer Vision}, Year = {2017} } ``` ## Contact If you have any suggestion or interested in using UnrealCV, please [contact us](http://unrealcv.github.io/contact.html).

AI & Machine Learning Game Development

2.2K Github Stars

Open Source

synthetic-computer-vision

# Synthetic for Computer Vision This is a repo for tracking the progress of using synthetic images for computer vision research. If you found any important work is missing or information is not up-to-date, please edit this file directly and make a pull request. Each publication is tagged with a keyword to make it easier to search. If you find anything missing from this page, please edit this `README.md` file to add it. When adding a new item, you can simply follow the format of existing items. How this document is structured is documented in [`contribute.md`](contribute.md). **How to use**: Click publication to jump to the paper title, detailed information such as code and project page will be provided together with pdf file.** <div id="dataset"></div> ## Synthetic image dataset - [SunCG (Princeton)](https://sscnet.cs.princeton.edu/) - [Minos](https://minosworld.github.io/) - [House3d (Facebook)](https://github.com/facebookresearch/House3D) - [Procedural Human Action Videos (PHAV)](#de2016procedural) - [SURREAL](#varol2017learning) - [Virtual KITTI](#gaidon2016virtual) - [Synthia](#ros2016synthia) - [Sintel](#butler2012naturalistic), A synthetic dataset for optical flow - [SceneFlow](#mayer2015large) - [4D Light Fields](#honauer2016dataset) - [ICL-NUIM dataset](#handa2014benchmark) - [Driving in the Matrix](#drivingthematrix) - [Playing for Benchmarks](http://playing-for-benchmarks.org/overview/) <div id="models"></div> ## 3D Model Repository Realistic 3D models are critical for creating realistic and diverse virtual worlds. Here are research efforts for creating 3D model repositories. - [ShapeNet](#chang2015shapenet) - [3dscan](#choi2016large) - [seeing3Dchairs](#aubry2014seeing) <div id="tool"></div> ## Tools - [AIPlayground: UE4 Based Data Ablation tool](#mousavi2020ai), see [project page](https://github.com/MMehdiMousavi/AIP) - [AirSim (Microsoft)](https://github.com/Microsoft/AirSim) - [CARLA (Intel)](https://github.com/carla-simulator/carla) - [Unity ML agents](https://blogs.unity3d.com/2017/09/19/introducing-unity-machine-learning-agents/) - Render SMPL human bodies on Blender, see [CVPR2017](#varol2017learning) - Render for CNN, based on Blender, see [ICCV2015](#su2015render) - [UETorch](https://github.com/facebook/UETorch), based on UE4, see [ICML2016](#lerer2016learning) - [UnrealCV](https://github.com/unrealcv/unrealcv), based on UE4, see [ArXiv](#qiu2016unrealcv) - VizDoom, based on Doom, see [ArXiv](#kempka2016vizdoom) - OpenAI Universe, see [project page](https://universe.openai.com/) - Blender addon for 4D light field rendering, see [project page](https://github.com/lightfield-analysis/blender-addon) - Event-Camera Dataset and Simulator see [project page](https://github.com/uzh-rpg/rpg_davis_simulator) - [NVIDIA Deep learning Dataset Synthesizer (NDDS)](https://github.com/NVIDIA/Dataset_Synthesizer) <div id="resource"></div> ## Resources [ECCV 2016 Workshop Virtual/Augmented Reality for Visual Artificial Intelligence (VARVAI) workshop](http://adas.cvc.uab.es/varvai2016/) [ICCV 2017 Workshop Role of Simulation in Computer Vision](https://www.microsoft.com/en-us/research/event/iccv-2017-role-of-simulation-in-computer-vision/) [Virtual Reality Meets Physical Reality: Modelling and Simulating Virtual Humans and Environments Siggraph Asia 2016 workshop](http://sigvr.org/) [CVPR 2017 Workshop THOR Challenge](http://vuchallenge.org/thor.html) See also: http://riemenschneider.hayko.at/vision/dataset/index.php?filter=+synthetic ## Misc. - RealismCNN [github](https://github.com/junyanz/RealismCNN) - Abnormality Detection in Images(http://paul.rutgers.edu/~babaks/abnormality_detection.html) <div id="reference"></div> ## Reference  ### 2020 <div id="mousavi2020ai"></div> - Mousavi, Mehdi and Khanal, Aashis and Estrada, Rolando. "AI Playground: Unreal Engine-based Data Ablation Tool for Deep Learning" International Symposium on Visual Computing (ISVC), 2020. ([pdf](https://arxiv.org/abs/2007.06153)) ([project](https://github.com/MMehdiMousavi/AIP)) ### 2017 (Total=12) - Adversarially Tuned Scene Generation ([pdf](https://arxiv.org/pdf/1701.00405.pdf)) - UE4Sim: A Photo-Realistic Simulator for Computer Vision Applications ([pdf](https://arxiv.org/abs/1708.05869)) ([project](https://ue4sim.org/)) <div id="richterplaying"></div> - Playing for Benchmarks ([pdf](http://vladlen.info/papers/playing-for-benchmarks.pdf)) <div id="mitash2017self"></div> - A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation <span class="octicon octicon-mark-github"></span> ([:octocat:code](https://github.com/cmitash/PHYSIM_6DPose)) ([pdf](https://arxiv.org/pdf/1703.03347.pdf)) ([project](http://paul.rutgers.edu/~cm1074/PHYSIM.html)) <div id="de2016procedural"></div> - Procedural Generation of Videos to Train Deep Action Recognition Networks ([pdf](http://openaccess.thecvf.com/content_cvpr_2017/papers/de_Souza_Procedural_Generation_of_CVPR_2017_paper.pdf)) ([project](http://adas.cvc.uab.es/phav/)) ([citation:8](https://scholar.google.com/scholar?cites=12002008688864745159&as_sdt=2005&sciodt=0,5&hl=en)) <div id="varol2017learning"></div> - Learning from Synthetic Humans <span class="octicon octicon-mark-github"></span> ([:octocat:code](https://github.com/gulvarol/surreal)) ([pdf](https://arxiv.org/abs/1701.01370)) ([project](http://www.di.ens.fr/willow/research/surreal/)) tag: synthetic human - [Nvidia Issac](http://www.marketwired.com/press-release/nvidia-ushers-new-era-robotics-with-breakthroughs-making-it-easier-build-train-intelligent-2215481.htm) - Configurable, Photorealistic Image Rendering and Ground Truth Synthesis by Sampling Stochastic Grammars Representing Indoor Scenes <div id="airsim"></div> - Aerial Informatics and Robotics Platform <span class="octicon octicon-mark-github"></span> ([:octocat:code](https://github.com/Microsoft/AirSim)) ([pdf](https://www.microsoft.com/en-us/research/wp-content/uploads/2017/02/aerial-informatics-robotics-TR.pdf)) ([project](https://www.microsoft.com/en-us/research/project/aerial-informatics-robotics-platform/)) tag: tool <div id="tobin2017domain"></div> - Tobin, Josh, et al. "Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World." arXiv preprint arXiv:1703.06907 (2017). tag: domain ([pdf](https://arxiv.org/pdf/1703.06907.pdf)) <div id="drivingthematrix"></div> - M. Johnson-Roberson, C. Barto, R. Mehta, S. N. Sridhar, Karl Rosaen,and R. Vasudevan, “Driving in the matrix: Can virtual worlds replace human-generated annotations for real world tasks?,” in IEEE International Conference on Robotics and Automation, pp. 1–8, 2017. <span class="octicon octicon-mark-github"></span> ([:octocat:code](https://github.com/umautobots/driving-in-the-matrix)) ([pdf](https://arxiv.org/pdf/1610.01983)) ([project](https://fcav.engin.umich.edu/sim-dataset/)) ([citation:3](https://scholar.google.com/scholar?um=1&ie=UTF-8&lr&cites=2191650018344815319)) <div id="person re-ID"></div> - Zheng Z, Zheng L, Yang Y. "Unlabeled samples generated by gan improve the person re-identification baseline in vitro" in Proceedings of IEEE International Conference on Computer Vision, 2017. <span class="octicon octicon-mark-github"></span> ([:octocat:code](https://github.com/layumi/Person-reID_GAN)) ([pdf](https://arxiv.org/abs/1701.07717)) ([citation:48](https://scholar.google.com/scholar?oi=bibs&hl=zh-CN&cites=270746001988088124)) tag: generated images by GAN ### 2016 (Total=17) <div id="sadeghi2016rl"></div> - Sadeghi, Fereshteh, and Sergey Levine. "rl: Real single-image flight without a single real image. arXiv preprint." arXiv preprint arXiv:1611.04201 12 (2016). tag: rl - Johnson, Justin, et al. "CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning." arXiv preprint arXiv:1612.06890 (2016). ([pdf](https://arxiv.org/abs/1612.06890)) - McCormac, John, et al. "SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth." arXiv preprint arXiv:1612.05079 (2016). - de Souza, César Roberto, et al. "Procedural Generation of Videos to Train Deep Action Recognition Networks." arXiv preprint arXiv:1612.00881 (2016). ([pdf](https://arxiv.org/abs/1612.00881)) ([project](http://adas.cvc.uab.es/phav/)) tag: synthetic human - Synnaeve, Gabriel, et al. "TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games." arXiv preprint arXiv:1611.00625 (2016). ([pdf](https://arxiv.org/abs/1611.00625)) ([code](https://github.com/TorchCraft/TorchCraft)) - Lin, Jenny, et al. "A virtual reality platform for dynamic human-scene interaction." SIGGRAPH ASIA 2016 Virtual Reality meets Physical Reality: Modelling and Simulating Virtual Humans and Environments. ACM, 2016. ([pdf](https://xiaozhuchacha.github.io/projects/siggraphasia16_vrplatform/vrplatform2016siggraphasia.pdf)) ([project](https://xiaozhuchacha.github.io/projects/siggraphasia16_vrplatform/index.html)) - Mahendran, A., et al. "ResearchDoom and CocoDoom: Learning Computer Vision with Games." arXiv preprint arXiv:1610.02431 (2016). ([pdf](https://arxiv.org/pdf/1610.02431.pdf)) ([project](www.robots.ox.ac.uk/~vgg/research/researchdoom/)) <div id="ros2016synthia"></div> - The SYNTHIA dataset: A large collection of synthetic images for semantic segmentation of urban scenes. 2016 ([pdf](http://www.cv-foundation.org/openaccess/content_cvpr_2016/html/Ros_The_SYNTHIA_Dataset_CVPR_2016_paper.html)) ([project](http://synthia-dataset.net/)) ([citation:4](http://scholar.google.com/scholar?cites=9178628328030932213&as_sdt=2005&sciodt=0,5&hl=en)) <div id="gaidon2016virtual"></div> - Virtual Worlds as Proxy for Multi-Object Tracking Analysis. 2016 ([pdf](http://arxiv.org/abs/1605.06457)) ([project](http://www.xrce.xerox.com/Research-Development/Computer-Vision/Proxy-Virtual-Worlds)) ([citation:5](http://scholar.google.com/scholar?cites=11727455440906017188&as_sdt=2005&sciodt=0,5&hl=en)) - Playing for data: Ground truth from computer games. 2016 ([pdf](http://link.springer.com/chapter/10.1007/978-3-319-46475-6_7)) ([citation:1](http://scholar.google.com/scholar?cites=12822958035144353200&as_sdt=2005&sciodt=0,5&hl=en)) - Play and Learn: Using Video Games to Train Computer Vision Models. 2016 ([pdf](http://arxiv.org/abs/1608.01745)) ([citation:1](http://scholar.google.com/scholar?cites=16081073673799361643&as_sdt=2005&sciodt=0,5&hl=en)) - ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning. 2016 ([:octocat:code](https://github.com/Marqt/ViZDoom)) ([pdf](http://arxiv.org/abs/1605.02097)) ([project](http://vizdoom.cs.put.edu.pl/)) ([citation:4](http://scholar.google.com/scholar?cites=4101579648300742816&as_sdt=2005&sciodt=0,5&hl=en)) <div id="choi2016large"></div> - A large dataset of object scans. 2016 ([pdf](http://arxiv.org/abs/1602.02481)) ([project](http://redwood-data.org/3dscan/)) ([citation:6](http://scholar.google.com/scholar?cites=5989950372336055491&as_sdt=2005&sciodt=0,5&hl=en)) <div id="qiu2016unrealcv"></div> - UnrealCV: Connecting Computer Vision to Unreal Engine 2016 <span class="octicon octicon-mark-github"></span> ([:octocat:code](https://github.com/unrealcv/unrealcv)) ([project](http://unrealcv.github.io)) ([pdf](http://arxiv.org/abs/1609.01326)) <div id="lerer2016learning"></div> - Learning Physical Intuition of Block Towers by Example 2016 ([:octocat:code](https://github.com/facebook/UETorch)) ([pdf](http://arxiv.org/abs/1603.01312)) ([citation:12](http://scholar.google.com/scholar?cites=12846348306706460250&as_sdt=2005&sciodt=0,5&hl=en)) - Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning 2016 ([pdf](http://arxiv.org/abs/1609.05143)) <div id="honauer2016dataset"></div> - A Dataset and Evaluation Methodology for Depth Estimation on 4D Light Fields. ACCV 2016 ([:octocat:code](https://github.com/lightfield-analysis)) ([pdf](http://lightfield-analysis.net/benchmark/paper/lightfield_benchmark_accv_2016.pdf)) ([project](http://lightfield-analysis.net/)) ([citation](https://scholar.google.de/scholar?cluster=3369030498099069181&hl=en&as_sdt=0,5)) ### 2015 (Total=3) - A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation. 2015 ([pdf](http://arxiv.org/abs/1512.02134)) ([citation:9](http://scholar.google.com/scholar?cites=16431759299155441580&as_sdt=2005&sciodt=0,5&hl=en)) <div id="su2015render"></div> - Render for cnn: Viewpoint estimation in images using cnns trained with rendered 3d model views. 2015 ([:octocat:code](https://github.com/ShapeNet/RenderForCNN)) ([pdf](http://www.cv-foundation.org/openaccess/content_iccv_2015/html/Su_Render_for_CNN_ICCV_2015_paper.html)) ([citation:33](http://scholar.google.com/scholar?cites=1209553997502402606&as_sdt=2005&sciodt=0,5&hl=en)) <div id="chang2015shapenet"></div> - Shapenet: An information-rich 3d model repository. 2015 ([pdf](http://arxiv.org/abs/1512.03012)) ([project](http://shapenet.cs.stanford.edu/)) ([citation:27](http://scholar.google.com/scholar?cites=1341601736562194564&as_sdt=2005&sciodt=0,5&hl=en)) ### 2014 (Total=2) - Virtual and real world adaptation for pedestrian detection. 2014 ([pdf](http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6587038)) ([citation:46](http://scholar.google.com/scholar?cites=2637402509859183337&as_sdt=2005&sciodt=0,5&hl=en)) <div id="aubry2014seeing"></div> - Seeing 3d chairs: exemplar part-based 2d-3d alignment using a large dataset of cad models. 2014 ([:octocat:code](https://github.com/dimatura/seeing3d)) ([pdf](http://www.cv-foundation.org/openaccess/content_cvpr_2014/html/Aubry_Seeing_3D_Chairs_2014_CVPR_paper.html)) ([project](http://www.di.ens.fr/willow/research/seeing3Dchairs/)) ([citation:110](http://scholar.google.com/scholar?cites=18030645502969108287&as_sdt=2005&sciodt=0,5&hl=en)) <div id="handa2014benchmark"></div> - Handa, Ankur, Thomas Whelan, John McDonald, and Andrew J. Davison. "A benchmark for RGB-D visual odometry, 3D reconstruction and SLAM." In Robotics and automation (ICRA), 2014 IEEE international conference on, pp. 1524-1531. IEEE, 2014. ([project](https://www.doc.ic.ac.uk/~ahanda/VaFRIC/iclnuim.html)) ### 2013 (Total=1) - Detailed 3d representations for object recognition and modeling. 2013 ([pdf](http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6516504)) ([citation:67](http://scholar.google.com/scholar?cites=6595507135181144034&as_sdt=2005&sciodt=0,5&hl=en)) ### 2012 (Total=1) <div id="butler2012naturalistic"></div> - A naturalistic open source movie for optical flow evaluation. 2012 ([pdf](http://link.springer.com/chapter/10.1007/978-3-642-33783-3_44)) ([project](http://sintel.is.tue.mpg.de/)) ([citation:227](http://scholar.google.com/scholar?cites=15124407213489971559&as_sdt=20000005&sciodt=0,21&hl=en)) ### 2010 (Total=1) - Learning appearance in virtual scenarios for pedestrian detection. 2010 ([pdf](http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5540218)) ([citation:79](http://scholar.google.com/scholar?cites=17243485674852907889&as_sdt=2005&sciodt=0,5&hl=en)) ### 2007 (Total=1) - Ovvv: Using virtual worlds to design and evaluate surveillance systems. 2007 ([pdf](http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4270516)) ([citation:58](http://scholar.google.com/scholar?cites=3459961090644684583&as_sdt=2005&sciodt=0,5&hl=en))

Data Labeling Compliance & Governance

1K Github Stars

unrealcv

Software by unrealcv

unrealcv

synthetic-computer-vision