Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing and fake locations in Wikipedia POI layer #1972

Closed
2 tasks done
zstadler opened this issue Feb 10, 2024 · 3 comments
Closed
2 tasks done

Missing and fake locations in Wikipedia POI layer #1972

zstadler opened this issue Feb 10, 2024 · 3 comments

Comments

@zstadler
Copy link
Member

What happened?

Following a phone discussion and a user bug report

The current algorithm for adding Wikipedia POIs to IHM has a few issues:

  1. It may drop Wikipedia pages that should be shown or merged with OSM POIs.
  2. It may show Wikipedia pages that do not have a location.
  3. It is not scalable

This issue proposes to reverse the search direction. Rather than searching Wikipedia for pages that exists within a given bounding box, the new approach would identify the relevant Wikipedia pages based on OSM tags.

The down-side for such a change is that the IHM POI layer will no longer show Wikipedia pages that do not have an OSM element.

The identification could use the the wikipedia, wikipedia:he, and wikipedia:en tags, as used in the POI merge algorithm, or it can rely solely on wikidata tags, similar to openmaptiles.

Note: The current search-and-match algorithm could be used for periodic bulk OSM edits that will add the missing tag/s.

What steps will reproduce the bug?

The POI layer does not include a Wikipedia POI for זיתן, and the OSM-based POI for זיתן does not have a Wikipedia link

Hebrew Wikipedia has a page for זיתן with a location.
image

The Hebrew Wikipedia API returns that page

{
  "batchcomplete": "",
  "query": {
    "geosearch": [
      {
        "pageid": 44693,
        "ns": 0,
        "title": "זיתן",
        "lat": 31.9756126464499,
        "lon": 34.8905390342779,
        "dist": 22.9,
        "primary": ""
      }
    ]
  }
}

There is also an opposite case where a Wikipedia page that does not have a location is shown as a IHM POI
The Wikipedia page בית הכנסת צלאת בן שאיף does not have a location, it is not returned by the Wikipedia API, but it exists as an IHM POI.

What I expect to happen

Use OSM tags to identify Wikipedia pages

Platform

  • Israel Hiking Map app
  • Israel Hiking Map site in a browser

OS Name and Version

All

Browser Name and Version

All

Additional information

These problems with the contents of the Wikipedia POI layer, may also be related to #1971.

@zstadler zstadler added the bug label Feb 10, 2024
@Tsurha
Copy link

Tsurha commented Mar 2, 2024

Note that many Wikipedia POIs are not exactly at the right coordinates. This is natural for an encyclopedia that is mostly text based and less about exact geo like osm. When merging, my recommendation is to rely on OSM for location.

@HarelM
Copy link
Member

HarelM commented Mar 2, 2024

If the POI exists in OSM and merged to it, the location is from OSM, not Wikipedia.

Copy link

github-actions bot commented Jun 1, 2024

This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 7 days

@github-actions github-actions bot added the stale label Jun 1, 2024
@github-actions github-actions bot closed this as completed Jun 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants