如何（通常）计数重复行（并删除重复项）？

更新时间：2023-01-30 11:22:05

你一直有正确的想法。您必须在每个表格上运行以下内容：

  select table titel，interpret，jahr from table1

您可以将不同的行转储到另一个表中，如下所示：

 创建表table2作为
选择不同的titel，解释，jahr从table1

然后你可以像这样放下这个简单的表：

  drop table table1

将新创建的表重命名为table1，如下所示：

 code> alter table table2重命名为table1

查找表中每条记录的行号：

  select a.rowid，a。* from table1 a

要查找只有重复记录的行号：

  select a.rowid，a。* 
 from table1 a 
 inner join 
（
 select titel，interpret，jahr 
 from table1 
 group by titel，explain，jahr 
 have count（*）> 1 
）b on 
 a.titel = b.titel 
和a.interpret = b.interpret 
和a.jahr = b.jahr

Is there a generic SELECT statement to detect duplicate rows ("identical", where all columns are equal)? E.G, columns 2 & 4 in the following table

 titel                             | interpret        | jahr
-----------------------------------+------------------+-----
 Beauty                            | Ryuichi Sakamoto | 1990
 Goodbye Country (Hello Nightclub) | Groove Armada    | 2001
 Glee                              | Bran Van 3000    | 1997
 Goodbye Country (Hello  Nightclub)| Groove Armada    | 2001

Or do I need a SELECT which is specific to the table?

Someone has given me an Sqlite d/b with multiple tables each of which look like like they have multiple identical rows (with different columns in each table), so I would prefer a generic solution.

After that, I have to figure out how to delete the duplicates. Maybe I can use DISTINCT on the SELECT, store in a temp table, delete the original and rename the temp table?

You had the right idea all along. You will have to run the following on each table:

select distinct titel, interpret, jahr from table1

You could dump the distinct rows into another table like so:

create table table2 as
select distinct titel, interpret, jahr from table1

You can then drop the intial table like so:

drop table table1

Rename the newly created table to table1 like so:

alter table table2 rename to table1

To find row number of each record in the table:

select a.rowid, a.* from table1 a

To find row number of only the records that are duplicate:

select a.rowid, a.* 
from table1 a 
inner join 
(
      select titel, interpret, jahr 
      from table1 
      group by titel, interpret, jahr 
      having count(*) > 1
) b on 
      a.titel = b.titel 
      and a.interpret = b.interpret 
      and a.jahr = b.jahr

上一篇 : ：为任何本地依赖项运行安装后挂钩下一篇 : 春季启动-MySQL设置不起作用

如何（通常）计数重复行（并删除重复项）？

相关阅读

技术问答最新文章