Revision as of 03:01, 11 September 2020 edit Johanna-Hypatia (talk \| contribs) Extended confirmed users 4,860 edits m typo patrol ← Previous edit		Revision as of 21:17, 7 February 2021 edit undo 88.202.161.165 (talk) →History Next edit →
Line 4: == History == The original goal of R-CNN was to take an input image and produce a set of bounding boxes as output, where ~~the~~ each bounding box contains an object and also the category (e.g. car or pedestrian) of the object. More recently, R-CNN has been extended to perform other computer vision tasks. The following covers some of the versions of R-CNN that have been developed. * November 2013: '''R-CNN'''. Given an input image, R-CNN begins by applying a mechanism called Selective Search to extract [[Region of interest\|regions of interest]] (ROI), where each ROI is a rectangle that may represent the boundary of an object in image. Depending on the scenario, there may be as many as two thousand ROIs. After that, each ROI is fed through a neural network to produce output features. For each ROI's output features, a collection of [[support-vector machine]] classifiers is used to determine what type of object (if any) is contained within the ROI.<ref>{{Cite news\|last=Gandhi\|first=Rohith\|url=https://towardsdatascience.com/r-cnn-fast-r-cnn-faster-r-cnn-yolo-object-detection-algorithms-36d53571365e\|title=R-CNN, Fast R-CNN, Faster R-CNN, YOLO — Object Detection Algorithms\|date=July 9, 2018\|work=Towards Data Science\|access-date=March 12, 2020\|url-status=live}}</ref>

Region Based Convolutional Neural Networks: Difference between revisions