Looking for smallest VLM for NSFW image detector (atleast 5 it/s on CPU)
r/LocalLLaMA
•
Computer Vision
NLP
AI Hardware
Hello everyone, I am looking for a very small VLM or Transformer based ViT, which will inference over images (each size less than 10MB, any ratio/resolution possible). The model should return 1 or 0 that the img is NSFW or not, thats it. I want the model to be run on CPU only, no GPU and very lightweight model I need. What should I use in this case? What are the current scenario here! Thanks in advance. submitted by /u/nihalxx3 [link] [comments]