且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

如何删除特定列中的重复值而不删除相关行

更新时间:2023-02-02 22:20:18

我会将此添加到评论中,但我还没有权限...



我不认为你实际上想要改变数据,但正如评论中提到的那样,有一些简单的方法可以做到这一点。



如果你只是试图显示多线程,三维数据,你只是不熟悉库语法尝试下面的代码...

$ $ p $ df Market = c('Indonesia','Australia','India','USA','Germany','India','Japan','Russia','Austria','瑞士','美国','印度尼西亚'),
数量= c(1000,500,300,500,200,400,4
stringsAsFactors = F)

plotly :: ggplotly(
ggplot2 :: ggplot(df,ggplot2 :: aes(x = Market,y =数量))+
ggplot2 :: geom_col(ggplot2 :: aes(fill = Market))+
ggplot2 :: facet_grid(〜Date,scale ='free_x')+
ggthemes :: theme_tufte()


Want to remove duplicate values in specific column without deleting the rows related with duplicate column values as below example:

Input
-----
    Date    Market      Quantity
4/2/2018    Indonesia   1000
4/2/2018    Australia   500
4/2/2018    India       300
4/2/2018    USA         500
4/2/2018    Germany     200
5/2/2018    India       400
5/2/2018    Japan       400
5/2/2018    Russia      457
6/2/2018    Austria     260
6/2/2018    Swiss       700
6/2/2018    USA         1200
6/2/2018    Indonesia   400


output
------
    Date    Market      Quantity
4/2/2018    Indonesia   1000
            Australia   500
            India       300
            USA         500
            Germany     200
5/2/2018    India       400
            Japan       400
            Russia      457
6/2/2018    Austria     260
            Swiss       700
            USA         1200
            Indonesia   400

And if possible , how to plot a graph(bar/column) for same output(something like given)? Sample Graph

I would add this to comments but I don't have rights yet...

I don't think you actually want to change the data, but as a few mentioned in the comments there are easy ways to do that.

If you're just trying to show the multi-dimensional data in plotly and you're just not familiar with the library syntax try the code below...

df <- data.frame(Date = c('2018/04/02','2018/04/02','2018/04/02','2018/04/02','2018/04/02','2018/05/02','2018/05/02','2018/05/02','2018/06/02','2018/06/02','2018/06/02','2018/06/02'),
  Market = c('Indonesia','Australia','India','USA','Germany','India','Japan','Russia','Austria','Swiss','USA','Indonesia'),
  Quantity = c(1000,500,300,500,200,400,400,457,260,700,1200,400),
  stringsAsFactors = F)

plotly::ggplotly(
      ggplot2::ggplot(df, ggplot2::aes(x=Market, y=Quantity)) +
        ggplot2::geom_col(ggplot2::aes(fill=Market))+
        ggplot2::facet_grid(~Date,scale='free_x') +
        ggthemes::theme_tufte()
      )