ABSTRACT
CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the field of computer and information science. After serving as a public search engine for nearly ten years, CiteSeer is starting to have scaling problems for handling of more documents, adding new feature and more users. Its monolithic architecture design prevents it from effectively making use of new web technologies and providing new services. After analyzing the current system problems, we propose a new architecture and data model, CiteSeerx. CiteSeerx that will overcome the existing problems as well as provide scalability and better performance plus new services and system features.
- C. L. Giles and I. G. Councill. Who gets acknowledged: measuring scientific contributions through automatic acknowledgement indexing. Proceedings of the National Academy of Sciences, 101(51):17599--17604, 2004.Google ScholarCross Ref
- R. Kahn and R. Wilensky. A framework for distributed digital object services. Technical Report, cnri.dlib/tn95-01, 1995.Google Scholar
- S. Lawrence, C. L. Giles, and K. Bollacker. Digital libraries and Autonomous Citation Indexing. IEEE Computer, 32(6):67--71, 1999. Google ScholarDigital Library
- Y. Petinot, C. L. Giles, V. Bhatnagar, P. B. Teregowda, H. Han, and I. Councill. A service-oriented architecture for digital libraries. In ICSOC, pages 263--268, 2004. Google ScholarDigital Library
Index Terms
-
CiteSeerx: an architecture and web service design for an academic document search engine
-
Recommendations
-
CiteSeerX: 20 years of service to scholarly big data
AIDR '19: Proceedings of the Conference on Artificial Intelligence for Data Discovery and ReuseWe overview CiteSeerX, the pioneer digital library search engine, that has been serving academic communities for more than 20 years (first released in 1998), from three perspectives. The system perspective summarizes its architecture evolution in three ...
-
CiteSeerX: AI in a digital library search engine
AAAI'14: Proceedings of the Twenty-Eighth AAAI Conference on Artificial IntelligenceCiteSeerX is a digital library search engine that provides access to more than 4 million academic documents with nearly a million users and millions of hits per day. Artificial intelligence (AI) technologies are used in many components of CiteSeerX e.g. ...
-
Scaling rules in the science system: Influence of field-specific citation characteristics on the impact of individual researchers
The representation of science as a citation density landscape and the study of scaling rules with the field-specific citation density as a main topological property was previously analyzed at the level of research groups. Here, the focus is on the ...
Comments