Kentaro Wada wkentaro

I build tools for computer vision.

labelme ★16k+ - label images for segmentation, detection, and classification
imgviz ★264+ - visualize images and labels without OpenCV or matplotlib
octomap-python ★100+ - Python bindings for the OctoMap 3D mapping library
osam - run SAM1/2/3, EfficientSAM, YOLO-World, and other promptable vision models locally
sam3-onnx - ONNX export and inference for SAM3
yolo-world-onnx - ONNX models for YOLO-World open-vocabulary detection
labelme-satellite-image-demo - annotate satellite imagery with labelme and convert to GeoJSON for QGIS

gdown ★5.3k+ - download Google Drive files that wget and curl choke on
imshow - display images from Python with a customizable viewer
moviepy-cli - edit videos from the command line via MoviePy
jqk - render JSON with jq patterns
git-hunk - non-interactive git hunk staging for AI agents
ihq (wip) - externalize git-ignored files to a synced, identity-derived store
acron (wip) - schedule unattended coding agents on your own server via cron

pytorch-fcn ★1.8k+ - Fully Convolutional Networks in PyTorch
pytorch-for-numpy-users ★705+ - PyTorch reference for NumPy users
morefusion ★238+ - 6D pose estimation from volumetric fusion (CVPR 2020)
fcn ★216+ - Fully Convolutional Networks in Chainer
video-cli ★136+ - command-line tools for quick video editing
gshell ★112+ - navigate Google Drive as you do on a shell
reorientbot ★58+ - learning object reorientation for posed placement (ICRA 2022)
safepicking ★56+ - safe object extraction via object-level mapping (ICRA 2022)

Provide feedback