修复mysql查询以返回子组内的随机行 [英] fix mysql query to return random row within subgroup

查看:40
本文介绍了修复mysql查询以返回子组内的随机行的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在使用以下查询从具有最小时间距离(YEAR 和 MMDD 字段)的记录的子集中随机抽取一行,对于每个 ID1-ID2 对.

I'm using the following query to randomly draw one row from the subset, for each ID1-ID2 pair, of records that have the minimum distance in time (YEAR and MMDD fields).

CREATE TABLE temp4 AS 
     SELECT * 
      FROM temp3 
  GROUP BY ID1, ID2 
  ORDER BY DATEDIFF( CONCAT(YEAR,'-',LEFT(MMDD,2),'-',RIGHT(MMDD,2)), CONCAT(ID3_YEAR,'-',LEFT(ID3_MMDD,2),'-',RIGHT(ID3_MMDD,2)) ) ASC, RAND() 
     LIMIT 0, 1;

从我在这里发布的上一个问题来看,这就是表格的样子

From a previous question I've posted here, this is how the table looks like

ID1 ID2 YEAR  MMDD  ID3 YEAR_ID3  MMDD_ID3
---------------------------------------
1   2   1991  0821  55  1991      0822    
1   2   1991  0821  57  1991      0822    
1   2   1991  0821  88  1992      0101
1   3   1990  0131  89  2000      0202    
1   3   1990  0131  89  2001      0102

对于每个 ID1-ID2 对,我需要选择 ID3

FOR EACH ID1-ID2 pair, I need to select the ID3 with

时间方面的最小距离(YEAR 字段和 MMDD 字段,即我需要比较 YEAR 和 MMDD 与 YEAR_ID3 和 MMDD_ID3)

THE MINIMUM DISTANCE IN TERMS OF TIME (both YEAR field and MMDD field, i.e. I need to compare YEAR and MMDD vs. YEAR_ID3 and MMDD_ID3)

如果有多个 ID3 满足上述最低要求(即它们都具有相同的 YEAR_ID3 和 MMDD_ID3),我需要随机选择一个.

IF MORE THAN ONE ID3 SATISFIES THE MINIMUM REQUIREMENT ABOVE (i.e. they both have the same YEAR_ID3 and MMDD_ID3), I NEED TO SELECT ONE RANDOMLY.

在上面的例子中,查询应该返回

IN THE ABOVE EXAMPLE, THE QUERY SHOULD RETURN

1,2,1991,0821,55 (OR 1,2,1991,0821,57 - ACCORDING TO THE RANDOM DRAW)

1,3,1990,0131,89

我在上面粘贴的那个只返回一行... :(

THE ONE I'VE PASTED ABOVE ONLY RETURNS ONE ROW... :(

在 OMG 发布的评论部分中,不知何故有一个很好的解决方案......但它消失了?!?!?

SOMEHOW THERE WAS A GREAT SOLUTION BELOW IN THE COMMENTS SECTION POSTED BY OMG... but it disappeared?!?!?

我把它贴在这里

DROP TABLE IF EXISTS temp4;
CREATE TABLE temp4 AS
SELECT x.id1,
       x.id2,
       x.YEAR,
       x.MMDD,
       x.id3,
       x.id3_YEAR,
       x.id3_MMDD
 FROM (SELECT t.*,
               ABS(DATEDIFF(CONCAT(CAST(t.id3_YEAR AS CHAR(4)),'-', LEFT(t.id3_MMDD,2),'-',RIGHT(t.id3_MMDD,2)),
                        CONCAT(CAST(t.YEAR AS CHAR(4)),'-', LEFT(t.MMDD,2),'-',RIGHT(t.MMDD,2))))  AS diff,
               CASE 
                 WHEN @id1 = t.id1 AND @id2 = t.id2 THEN @rownum := @rownum + 1
                 ELSE @rownum := 1
               END AS rk,
               @id1 := t.id1,
               @id2 := t.id2
          FROM temp3 t
          JOIN (SELECT @rownum := 0, @id1  := 0, @id2 := 0) r
      ORDER BY t.id1, t.id2, diff, RAND()) x
 WHERE x.rk = 1;

我在这里粘贴了一个测试表的 SQL 转储

I'm pasting here a SQL dump of a test table

DROP TABLE IF EXISTS `temp3`;
CREATE TABLE IF NOT EXISTS `temp3` (
  `id1` char(7) NOT NULL,
  `id2` char(7) NOT NULL,
  `YEAR` year(4) NOT NULL,
  `MMDD` char(4) NOT NULL,
  `id3` char(7) NOT NULL,
  `id3_YEAR` year(4) NOT NULL,
  `id3_MMDD` char(4) NOT NULL
) ENGINE=MyISAM DEFAULT CHARSET=latin1;


INSERT INTO `temp3` VALUES('1', '2', 1992, '0107', '55', 1991, '0528');
INSERT INTO `temp3` VALUES('1', '2', 1992, '0107', '57', 1991, '0701');
INSERT INTO `temp3` VALUES('1', '3', 1992, '0107', '88', 2000, '0101');
INSERT INTO `temp3` VALUES('1', '3', 1992, '0107', '44', 2000, '0101');

推荐答案

从查询中提取随机记录的常用方法是

The usual way to extract a random record(s) from a query is

SELECT [] FROM table ORDER BY RAND() LIMIT []

SELECT [] FROM table ORDER BY RAND() LIMIT []

这篇关于修复mysql查询以返回子组内的随机行的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆