A power-efficient and scalable load-store queue design
dc.book.title | Integrated circuit and system design: power and timing modeling, optimization and simulation | |
dc.contributor.author | Castro, F. | |
dc.contributor.author | Chaver Martínez, Daniel Ángel | |
dc.contributor.author | Piñuel Moreno, Luis | |
dc.contributor.author | Prieto Matías, Manuel | |
dc.contributor.author | Huang, M. C. | |
dc.contributor.author | Tirado Fernández, José Francisco | |
dc.date.accessioned | 2023-06-20T13:41:50Z | |
dc.date.available | 2023-06-20T13:41:50Z | |
dc.date.issued | 2005 | |
dc.description | © Springer-Verlag Berlin Heidelberg 2005. We want to thank Simha Sethumadhavan for his helpful and thorough comments. International Workshop on Power and Timing Modeling, Optimization and Simulation (15th. sep 21-23, 2005.Lovaina, Belgica). | |
dc.description.abstract | The load-store queue (LQ-SQ) of modem superscalar processors is responsible for keeping the order of memory operations. As the performance gap between processing speed and memory access becomes worse, the capacity requirements for the LQ-SQ increase, and its design becomes a challenge due to its CAM structure. In this paper we propose an efficient load-store queue state filtering mechanism that provides a significant energy reduction (on average 35% in the LSQ and 3.5% in the whole processor), and only incurs a negligible performance loss of less than 0.6%. | |
dc.description.department | Sección Deptal. de Arquitectura de Computadores y Automática (Físicas) | |
dc.description.faculty | Fac. de Ciencias Físicas | |
dc.description.refereed | TRUE | |
dc.description.status | pub | |
dc.eprint.id | https://eprints.ucm.es/id/eprint/29749 | |
dc.identifier.isbn | 3-540-29013-3 | |
dc.identifier.officialurl | http://link.springer.com/chapter/10.1007/11556930_1 | |
dc.identifier.relatedurl | http://link.springer.com | |
dc.identifier.relatedurl | http://www.ece.rochester.edu/~mihuang/PAPERS/patmos05.pdf | |
dc.identifier.uri | https://hdl.handle.net/20.500.14352/53421 | |
dc.language.iso | eng | |
dc.page.final | 9 | |
dc.page.initial | 1 | |
dc.publisher | Springer-Verlag Berlin | |
dc.rights.accessRights | open access | |
dc.subject.cdu | 004 | |
dc.subject.keyword | Computer science | |
dc.subject.keyword | hardware & architecture | |
dc.subject.keyword | theory & methods | |
dc.subject.keyword | Engineering | |
dc.subject.keyword | electrical & electronic | |
dc.subject.ucm | Informática (Informática) | |
dc.subject.ucm | Programación de ordenadores (Informática) | |
dc.subject.unesco | 1203.17 Informática | |
dc.subject.unesco | 1203.23 Lenguajes de Programación | |
dc.title | A power-efficient and scalable load-store queue design | |
dc.type | book part | |
dc.volume.number | 3728 | |
dcterms.references | 1. R. E. Kessler. The Alpha 21264 Microprocessor. Technical Report, Compaq Computer Corporation, 1999. 2. B. Calder and G. Reinman. A Comparative Survey of Load Speculation Architectures. Journal of Instruction-Level Parallelism, May-2000. 3. C. Nairy and D. Soltis. Itanium-2 Processor Microarchitecture. IEEE-Micro, 23(2):44-55, March/April, 2003. 4. J. M. Tendler, J. S. Dodson, J. S. Fields Jr., H. Le and B. Sinharoy. Power-4 System Microarchitecture. IBM Journal of Research and Development, 46(1):5-26, 2002. 5. S. Sethumadhavan, R. Desikan, D. Burger, Charles R. Moore, Stephen W. Keckler. Scalable Hardware Memory Disambiguation for High ILP Processors. Proceedings of MICRO-36, December-2003. 6. T. Austin, E. Larson, and D. Ernst. SimpleScalar: An Infrastructure for Computer System Modeling. Computer, vol. 35, no. 2, Feb 2002. 7. D. Brooks, V. Tiwari, and M. Martonosi. Wattch: A Framework for Architectural-Level Power Analysis and Optimizations. 28-ISCA, Göteborg, Sweden. July, 2001. 8. T. Sherwood, E. Perelman, G. Hamerly, B. Calder. Automatically charecterizing large scale program behavior . Proceedings of ASPLOS-2002, October-2002. 9. S. Sethumadhavan, R. Desikan, D. Burger, Charles R. Moore, Stephen W. Keckler. Scalable Hardware Memory Disambiguation for High ILP Processors. IEEE-Micro, Vol. 24, Issue 6:118-127, November/December, 2004. 10. I. Park, C. Liang Ooi, T. N. Vijaykumar. Reducing design complexity of the load-store queue. Proceedings of MICRO-36, December-2003. 11. H. W. Cain and M. H. Lipasti. Memory Ordering: A Value-Based Approach. Proceedings of ISCA-31, June-2004. 12. A. Roth. A high-bandwidth load-store unit for single- and multi- threaded processors. Technical Report, University of Pennsylvania, 2004. 13. L. Baugh and C. Zilles. Decomposing the Load-Store Queue by Function for Power Reduction and Scalability. Proceedings of PAC Conference, October-2004. | |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | 6b8b1488-47cc-441e-921b-c1e8042d627c | |
relation.isAuthorOfPublication | 2ce782af-0e05-45eb-b58a-d2efffec6785 | |
relation.isAuthorOfPublication | 5d3f6717-1495-4217-853c-8c9c75d56620 | |
relation.isAuthorOfPublication | 1356616c-9e69-4852-8415-62fd0b8e7cfc | |
relation.isAuthorOfPublication.latestForDiscovery | 6b8b1488-47cc-441e-921b-c1e8042d627c |
Download
Original bundle
1 - 1 of 1