NMS（非极大值抑制）算法详解与示例

seo靠我 2023-09-25 13:07:49

一、NMS是什么？

NMS（non maximum suppression）即非极大值抑制，广泛应用于传统的特征提取和深度学习的目标检测算法中。

NMS原理是通过筛选出局部极大值得到最优解。

在2维边缘提取SEO靠我中体现在提取边缘轮廓后将一些梯度方向变化率较小的点筛选掉，避免造成干扰。

在三维关键点检测中也起到重要作用，筛选掉特征中非局部极值。

在目标检测方面如Yolo和RCNN等模型中均有使用，可以将较小分数的输SEO靠我出框过滤掉,同样，在三维基于点云的目标检测模型中亦有使用。

二、示例

1.opencv示例

查看opencv源码，可以知道canny算子中使用了nms，即对sobel等梯度计算方法生成的梯度矩阵中的点求取局SEO靠我部极大值。

其计算方法是比较中心点与其邻域的梯度值，如果为最大值，则保留，不是的话为0。

源码可见：

Canny算法解析，opencv源码实现及实例//读取图片Mat img = imread("true.SEO靠我jpg");Mat Grayimg;resize(img, img, Size(400, 600), 0, 0, INTER_LINEAR);cvtColor(img, Grayimg, COLOR_SEO靠我RGB2GRAY); //转为灰度图Canny(Grayimg, Grayimg, 100, 300, 3);imshow("picture0", img);imshow("picture", GraSEO靠我yimg);waitKey(0);return 0;

2.PCL示例

点云关键点特征提取算法经常会使用nms提取极大值点。

如3D SIFT关键点检测中需要计算尺度空间中像素点的26邻域的极值点。

算法原理参SEO靠我考：

PCL 3D-SIFT关键点检测(Z方向梯度约束)pcl::SIFTKeypoint<pcl::PointXYZ, pcl::PointWithScale> sift; pcl::SEO靠我PointCloud<pcl::PointWithScale> result; sift.setInputCloud(cloud_xyz); pcl::search::SEO靠我KdTree<pcl::PointXYZ>::Ptr tree(new pcl::search::KdTree<pcl::PointXYZ>()); sift.setSearchMetSEO靠我hod(tree); sift.setScales(0.01f, 7, 20); sift.setMinimumContrast(0.001f); siSEO靠我ft.compute(result);

3.目标检测中nms示例

nms在深度学习领域常用于对box的得分进行极大值筛选，在rcnn，yolo, pointnet等模型中广泛使用。

其算法流程大致为：

1：计SEO靠我算所有box的得分。

2：排序，依次与得分高的box的IOU进行对比，如果大于设定的阈值，就删除该框。

在yolo源代码detect.py可见：pred = non_max_suppression(preSEO靠我d, conf_thres, iou_thres, classes, agnostic_nms, max_det=max_det) conf_thres:置信度即得分score的阈值，SEO靠我yolo为0.25。 iou_thres：重叠度阈值，为0.45 classes：类别数，可以设置保留哪一类的box agnostic_nms:是否去除SEO靠我不同类别之间的框,默认false max_det:一张图片中最大识别种类的个数，默认300 def non_max_suppression(prediction, cSEO靠我onf_thres=0.25, iou_thres=0.45, classes=None, agnostic=False, multi_label=False,labels=(), max_det=3SEO靠我00):"""Runs Non-Maximum Suppression (NMS) on inference resultsReturns:list of detections, on (n,6) tSEO靠我ensor per image [xyxy, conf, cls]"""nc = prediction.shape[2] - 5 # number of classesxc = prediction[SEO靠我..., 4] > conf_thres # candidates# Checksassert 0 <= conf_thres <= 1, fInvalid Confidence threshold SEO靠我{conf_thres}, valid values are between 0.0 and 1.0assert 0 <= iou_thres <= 1, fInvalid IoU {iou_threSEO靠我s}, valid values are between 0.0 and 1.0# Settingsmin_wh, max_wh = 2, 4096 # (pixels) minimum and maSEO靠我ximum box width and heightmax_nms = 30000 # maximum number of boxes into torchvision.ops.nms()time_lSEO靠我imit = 10.0 # seconds to quit afterredundant = True # require redundant detectionsmulti_label &= nc SEO靠我> 1 # multiple labels per box (adds 0.5ms/img)merge = False # use merge-NMSt = time.time()output = [SEO靠我torch.zeros((0, 6), device=prediction.device)] * prediction.shape[0]for xi, x in enumerate(predictioSEO靠我n): # image index, image inference# Apply constraints# x[((x[..., 2:4] < min_wh) | (x[..., 2:4] > maSEO靠我x_wh)).any(1), 4] = 0 # width-heightx = x[xc[xi]] # confidence# Cat apriori labels if autolabellingiSEO靠我f labels and len(labels[xi]):l = labels[xi]v = torch.zeros((len(l), nc + 5), device=x.device)v[:, :4SEO靠我] = l[:, 1:5] # boxv[:, 4] = 1.0 # confv[range(len(l)), l[:, 0].long() + 5] = 1.0 # clsx = torch.catSEO靠我((x, v), 0)# If none remain process next imageif not x.shape[0]:continue# Compute confx[:, 5:] *= x[SEO靠我:, 4:5] # conf = obj_conf * cls_conf# Box (center x, center y, width, height) to (x1, y1, x2, y2)boxSEO靠我 = xywh2xyxy(x[:, :4])# Detections matrix nx6 (xyxy, conf, cls)if multi_label:i, j = (x[:, 5:] > conSEO靠我f_thres).nonzero(as_tuple=False).Tx = torch.cat((box[i], x[i, j + 5, None], j[:, None].float()), 1)eSEO靠我lse: # best class onlyconf, j = x[:, 5:].max(1, keepdim=True)x = torch.cat((box, conf, j.float()), 1SEO靠我)[conf.view(-1) > conf_thres]# Filter by classif classes is not None:x = x[(x[:, 5:6] == torch.tensoSEO靠我r(classes, device=x.device)).any(1)]# Apply finite constraint# if not torch.isfinite(x).all():# x = SEO靠我x[torch.isfinite(x).all(1)]# Check shapen = x.shape[0] # number of boxesif not n: # no boxescontinueSEO靠我elif n > max_nms: # excess boxesx = x[x[:, 4].argsort(descending=True)[:max_nms]] # sort by confidenSEO靠我ce# Batched NMSc = x[:, 5:6] * (0 if agnostic else max_wh) # classesboxes, scores = x[:, :4] + c, x[SEO靠我:, 4] # boxes (offset by class), scoresi = torchvision.ops.nms(boxes, scores, iou_thres) # NMSif i.sSEO靠我hape[0] > max_det: # limit detectionsi = i[:max_det]if merge and (1 < n < 3E3): # Merge NMS (boxes mSEO靠我erged using weighted mean)# update boxes as boxes(i,4) = weights(i,n) * boxes(n,4)iou = box_iou(boxeSEO靠我s[i], boxes) > iou_thres # iou matrixweights = iou * scores[None] # box weightsx[i, :4] = torch.mm(wSEO靠我eights, x[:, :4]).float() / weights.sum(1, keepdim=True) # merged boxesif redundant:i = i[iou.sum(1)SEO靠我 > 1] # require redundancyoutput[xi] = x[i]if (time.time() - t) > time_limit:print(fWARNING: NMS timSEO靠我e limit {time_limit}s exceeded)break # time limit exceededreturn output

“SEO靠我”的新闻页面文章、图片、音频、视频等稿件均为自媒体人、第三方机构发布或转载。如稿件涉及版权等问题，请与我们联系删除或处理，客服邮箱：html5sh@163.com，稿件内容仅为传递更多信息之目的，不代表本网观点，亦不代表本网站赞同其观点或证实其内容的真实性。