MySQL exist in-阿里云开发者社区

MySQL exist in

2017-11-08 1380

版权

本文内容由阿里云实名注册用户自发贡献，版权归原作者所有，阿里云开发者社区不拥有其著作权，亦不承担相应法律责任。具体规则请查看《阿里云开发者社区用户服务协议》和《阿里云开发者社区知识产权保护指引》。如果您发现本社区中有涉嫌抄袭的内容，填写侵权投诉表单进行举报，一经查实，本社区将立刻删除涉嫌侵权内容。

本文涉及的产品

云数据库 RDS MySQL，集群系列 2核4GB

RDS MySQL Serverless 基础系列，0.5-2RCU 50GB

RDS MySQL Serverless 高可用系列，价值2615元额度，1个月

简介： 转载自 weiyi1314exists对外表用loop逐条查询，每次查询都会查看exists的条件语句，当 exists里的条件语句能够返回记录行时(无论记录行是的多少，只要能返回)，条件就为真，返回当前loop到的这条记录，反之如果exists里的条件语句不能返回记录行，...

转载自 weiyi1314

exists对外表用loop逐条查询，每次查询都会查看exists的条件语句，当 exists里的条件语句能够返回记录行时(无论记录行是的多少，只要能返回)，条件就为真，返回当前loop到的这条记录，反之如果exists里的条件语句不能返回记录行，则当前loop到的这条记录被丢弃，exists的条件就像一个bool条件，当能返回结果集则为true，不能返回结果集则为 false

如下：

select * from user where exists (select 1);

        
          
        
        
        
          
          AI 代码解读

对user表的记录逐条取出，由于子条件中的select 1永远能返回记录行，那么user表的所有记录都将被加入结果集，所以与 select * from user;是一样的

又如下

select * from user where exists (select * from user where userId = 0);

        
          
        
        
        
          
          AI 代码解读

可以知道对user表进行loop时，检查条件语句(select * from user where userId = 0),由于userId永远不为0，所以条件语句永远返回空集，条件永远为false，那么user表的所有记录都将被丢弃

not exists与exists相反，也就是当exists条件有结果集返回时，loop到的记录将被丢弃，否则将loop到的记录加入结果集

总的来说，如果A表有n条记录，那么exists查询就是将这n条记录逐条取出，然后判断n遍exists条件

in查询相当于多个or条件的叠加，这个比较好理解，比如下面的查询

select * from user where userId in (1, 2, 3);

        
          
        
        
        
          
          AI 代码解读

等效于

select * from user where userId = 1 or userId = 2 or userId = 3;

        
          
        
        
        
          
          AI 代码解读

not in与in相反，如下

select * from user where userId not in (1, 2, 3);

        
          
        
        
        
          
          AI 代码解读

等效于

select * from user where userId != 1 and userId != 2 and userId != 3;

        
          
        
        
        
          
          AI 代码解读

总的来说，in查询就是先将子查询条件的记录全都查出来，假设结果集为B，共有m条记录，然后在将子查询条件的结果集分解成m个，再进行m次查询

值得一提的是，in查询的子条件返回结果必须只有一个字段，例如

select * from user where userId in (select id from B);

        
          
        
        
        
          
          AI 代码解读

而不能是

select * from user where userId in (select id, age from B);

        
          
        
        
        
          
          AI 代码解读

而exists就没有这个限制

下面来考虑exists和in的性能

考虑如下SQL语句

1: select * from A where exists (select * from B where B.id = A.id);

2: select * from A where A.id in (select id from B);

        
          
        
        
        
          
          AI 代码解读

查询1.可以转化以下伪代码，便于理解

for ($i = 0; $i < count(A); $i++) {

　　$a = get_record(A, $i); #从A表逐条获取记录

　　if (B.id = $a[id]) #如果子条件成立

　　　　$result[] = $a;

}

return $result;

        
          
        
        
        
          
          AI 代码解读

大概就是这么个意思，其实可以看到,查询1主要是用到了B表的索引，A表如何对查询的效率影响应该不大

假设B表的所有id为1,2,3,查询2可以转换为

select * from A where A.id = 1 or A.id = 2 or A.id = 3;


        
          
        
        
        
          
          AI 代码解读

这个好理解了，这里主要是用到了A的索引，B表如何对查询影响不大

下面再看not exists 和 not in

1. select * from A where not exists (select * from B where B.id = A.id);

2. select * from A where A.id not in (select id from B);

        
          
        
        
        
          
          AI 代码解读

看查询1，还是和上面一样，用了B的索引

而对于查询2，可以转化成如下语句

select * from A where A.id != 1 and A.id != 2 and A.id != 3;

        
          
        
        
        
          
          AI 代码解读

可以知道not in是个范围查询，这种!=的范围查询无法使用任何索引,等于说A表的每条记录，都要在B表里遍历一次，查看B表里是否存在这条记录

故not exists比not in效率高

mysql中的in语句是把外表和内表作hash 连接，而exists语句是对外表作loop循环，每次loop循环再对内表进行查询。一直大家都认为exists比in语句的效率要高，这种说法其实是不准确的。这个是要区分环境的。

如果查询的两个表大小相当，那么用in和exists差别不大。
如果两个表中一个较小，一个是大表，则子查询表大的用exists，子查询表小的用in：
例如：表A（小表），表B（大表）

1：
select * from A where cc in (select cc from B) 效率低，用到了A表上cc列的索引；
 
select * from A where exists(select cc from B where cc=A.cc) 效率高，用到了B表上cc列的索引。 
相反的
 
2：
select * from B where cc in (select cc from A) 效率高，用到了B表上cc列的索引；
 
select * from B where exists(select cc from A where cc=B.cc) 效率低，用到了A表上cc列的索引。
 

        
          
        
        
        
          
          AI 代码解读

not in 和not exists如果查询语句使用了not in 那么内外表都进行全表扫描，没有用到索引；而not extsts 的子查询依然能用到表上的索引。所以无论那个表大，用not exists都比not in要快。
in 与 =的区别

select name from student where name in ('zhang','wang','li','zhao'); 

        
          
        
        
        
          
          AI 代码解读

与

select name from student where name='zhang' or name='li' or name='wang' or name='zhao' 

        
          
        
        
        
          
          AI 代码解读

的结果是相同的。

MySQL exist in

热门文章

最新文章

相关课程

相关电子书

相关实验场景

推荐镜像

探索云世界

热门

云计算

大数据

云原生

人工智能

数据库

开发与运维

活动广场

任务中心

训练营

直播

乘风者计划

下载

镜像站

技术资料

MySQL exist in

热门文章

最新文章

相关课程

相关电子书

相关实验场景

推荐镜像