ChatGPT解决这个技术问题 Extra ChatGPT

Rails: select unique values from a column

I already have a working solution, but I would really like to know why this doesn't work:

ratings = Model.select(:rating).uniq
ratings.each { |r| puts r.rating }

It selects, but don't print unique values, it prints all values, including the duplicates. And it's in the documentation: http://guides.rubyonrails.org/active_record_querying.html#selecting-specific-fields

Another example with uniq stackoverflow.com/questions/8369812/…

S
Simon Fels
Model.select(:rating)

The result of this is a collection of Model objects. Not plain ratings. And from uniq's point of view, they are completely different. You can use this:

Model.select(:rating).map(&:rating).uniq

or this (most efficient):

Model.uniq.pluck(:rating)

Rails 5+

Model.distinct.pluck(:rating)

Update

Apparently, as of rails 5.0.0.1, it works only on "top level" queries, like above. Doesn't work on collection proxies ("has_many" relations, for example).

Address.distinct.pluck(:city) # => ['Moscow']
user.addresses.distinct.pluck(:city) # => ['Moscow', 'Moscow', 'Moscow']

In this case, deduplicate after the query

user.addresses.pluck(:city).uniq # => ['Moscow']

I did a: group(:rating).collect { |r| r.rating } Since map == collect, where can I read about this sintax you used (&:rating)? I don't see this in Ruby's documentation.
@user1261084: see Symbol#to_proc to understand .map(&:rating). PragDave explains
It's worth noting that Model.uniq.pluck(:rating) is the most efficient way of doing this - this generates SQL which use SELECT DISTINCT rather than applying .uniq to an array
In Rails 5, Model.uniq.pluck(:rating) will be Model.distinct.pluck(:rating)
If you want to select unique values from has_many relationship you can always do Model.related_records.group(:some_column).pluck(:some_column)
u
user664833

If you're going to use Model.select, then you might as well just use DISTINCT, as it will return only the unique values. This is better because it means it returns less rows and should be slightly faster than returning a number of rows and then telling Rails to pick the unique values.

Model.select('DISTINCT rating')

Of course, this is provided your database understands the DISTINCT keyword, and most should.


Model.select("DISTINCT rating").map(&:rating) to get an array of just the ratings.
Great for those with legacy apps using Rails 2.3
Yes..this works fantastic - however, but it only returns the DISTINCT attribute. How can you return the entire Model object as long as its distinct? So that you will have access to all of the attributes in the model in instances where the attribute is unique.
@Jackson_Sandland If you want a Model object, that would need to be instantiated from a record in the table. But you're not selecting a record just a unique value (from what may be multiple records).
S
Sergio Tulentsev

This works too.

Model.pluck("DISTINCT rating")

I believe pluck is Ruby 1.9.x and up. Anyone using a previous version won't have it. If you are in 1.9x and above, the ruby docs say this also works: Model.uniq.pluck(:rating)
pluck is a pure Rails > 3.2 method which has no dependency on Ruby 1.9.x See apidock.com/rails/v3.2.1/ActiveRecord/Calculations/pluck
Non-attribute arguments will be disallowed in Rails 6.1, so for those on 6.1+ the following should do the trick: Model.pluck(Arel.sql("DISTINCT rating"))
M
Marcin Nowicki

If you want to also select extra fields:

Model.select('DISTINCT ON (models.ratings) models.ratings, models.id').map { |m| [m.id, m.ratings] }

select extra fields <3 <3
C
Cameron Martin
Model.uniq.pluck(:rating)

# SELECT DISTINCT "models"."rating" FROM "models"

This has the advantages of not using sql strings and not instantiating models


This throws an error with Rails 5.1 / AR 5.1 => undefined method `uniq'
k
kuboon
Model.select(:rating).uniq

This code works as 'DISTINCT' (not as Array#uniq) since rails 3.2


In Rails 6 (At least 6.0.3) this doesn't generate a DISTINCT clause, so it's a SELECT models.rating FROM models and then Array#uniq
S
Sergio Tulentsev
Model.select(:rating).distinct

This is the only officially correct answer that's also super efficient. Although, adding .pluck(:rating) at the end will make it exactly what the OP asked for.
S
Slava Zharkov

Another way to collect uniq columns with sql:

Model.group(:rating).pluck(:rating)

Upvoted for providing an original solution to the problem. This may even be more performant than a DISTINCT clause depending on the DB.
m
marc_s

If I am going right to way then :

Current query

Model.select(:rating)

is returning array of object and you have written query

Model.select(:rating).uniq

uniq is applied on array of object and each object have unique id. uniq is performing its job correctly because each object in array is uniq.

There are many way to select distinct rating :

Model.select('distinct rating').map(&:rating)

or

Model.select('distinct rating').collect(&:rating)

or

Model.select(:rating).map(&:rating).uniq

or

Model.select(:name).collect(&:rating).uniq

One more thing, first and second query : find distinct data by SQL query.

These queries will considered "london" and "london " same means it will neglect to space, that's why it will select 'london' one time in your query result.

Third and forth query:

find data by SQL query and for distinct data applied ruby uniq mehtod. these queries will considered "london" and "london " different, that's why it will select 'london' and 'london ' both in your query result.

please prefer to attached image for more understanding and have a look on "Toured / Awaiting RFP".

https://i.stack.imgur.com/C2nj2.png


map & collect are aliases for the same method, there is no need to provide examples for both.
V
Vassilis

If anyone is looking for the same with Mongoid, that is

Model.distinct(:rating)

this one does not work now, it now returns multiples.
does not return distinct
F
Fernando Fabreti

Some answers don't take into account the OP wants a array of values

Other answers don't work well if your Model has thousands of records

That said, I think a good answer is:

    Model.uniq.select(:ratings).map(&:ratings)
    => "SELECT DISTINCT ratings FROM `models` " 

Because, first you generate a array of Model (with diminished size because of the select), then you extract the only attribute those selected models have (ratings)


S
Simon Fels

You can use the following Gem: active_record_distinct_on

Model.distinct_on(:rating)

Yields the following query:

SELECT DISTINCT ON ( "models"."rating" ) "models".* FROM "models"

A
Afsan Abdulali Gujarati

In my scenario, I wanted a list of distinct names after ordering them by their creation date, applying offset and limit. Basically a combination of ORDER BY, DISTINCT ON

All you need to do is put DISTINCT ON inside the pluck method, like follow

Model.order("name, created_at DESC").offset(0).limit(10).pluck("DISTINCT ON (name) name")

This would return back an array of distinct names.


S
Simon Fels
Model.pluck("DISTINCT column_name")